WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 7 : 

C12N 15/12, C07K 14/705, C12Q 1/68, 
A61K 38/17, C07K 16/28, 14/72 



A2 



(11) International Publication Number: WO 00/15793 

(43) International Publication Date: 23 March 2000 (23.03.00) 



(21) International Application Number: PCIYUS99/20958 

(22) International Filing Date: 17 September 1999 (17.09.99) 



(30) Priority Data: 
09/156,513 



17 September 1998 (17.09.98) US 



(63) Related by Continuation (CON) or Continuation-in-Part 
(CIP) to Earlier Application 

us 09/156,513 (CIP) 

Filed on 17 September 1998 (17.09.98) 



(71) Applicant (for all designated States except US): INCYTE 

PHARMACEUTICALS, INC. [US/US]; 3 1 74 Porter Drive, 
Palo Alto, CA 94304 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): BANDMAN, Olga 
[US/US]; 366 Anna Avenue, Mountain View, CA 94043 
(US). LAL, Preeti [IN/US]; 2382 Lass Drive, Santa Clara, 
CA 95054 (US). TANG, Y., Tom [CN/US]; 4230 Ranwick 
Court, San Jose, CA 95118 (US). CORLEY, Neil, C 
[US/US]; 1240 Dale Avenue #30, Mountain View, CA 
94040 (US). GUEGLER, Karl, J. [CH/US]; 1048 Oakland 
Avenue, Menlo Park, CA 94025 (US). GORGONE, Gina, 



A. [US/US]; 1253 Pinecrest Drive, San Francisco, CA 
94132 (US). BAUGHN, Marian, R. [US/US]; 14244 
Santiago Road, San Leandro, CA 94577 (US). 

(74) Agents: BILLINGS, Lucy, J. et al.; Incyte Pharmaceuticals, 
Inc., 3174 Porter Drive, Palo Alto, CA 94304 (US). 



(81) Designated States: AL, AM, AT, AU, AZ, BA, BB, BG, BR, 
BY, CA, CH, CN, CU, CZ, DE, DK, EE, ES, FI, GB, GD, 
GE, GH, GM, HR, HU, ID, IL, IN, IS, JP, KE, KG, KP, 
KR, KZ, LC, LK, LR, LS, LT, LU, LV, MD, MG, MK, 
MN, MW, MX, NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI, 
SK, SL, TJ, TM, TR, TT, UA, UG, US, UZ t VN, YU, ZW, 
ARIPO patent (GH, GM, KE, LS, MW, SD, SL, SZ, TZ, 
UG, ZW), Eurasian patent (AM, AZ, BY, KG, KZ, MD, 
RU, TJ, TM), European patent (AT, BE, CH, CY, DE, DK, 
ES, FI, FR, GB, GR, IE, IT, LU, MC, NL, PT, SE), OAPl 
patent (BF, BJ, CF, CG, CI. CM, GA, GN, GW, ML, MR, 
NE, SN,TD, TG). 



Published 

Without international search report and to be republished 
upon receipt of that report. 



(54) Title: HUMAN GPCR PROTEINS 
(57) Abstract 

The invention provides human GPCR proteins (HGPRP) and polynucleotides which identify and encode HGPRP. The invention also 
provides expression vectors, host celis, antibodies, agonists, and antagonists. The invention also provides methods for diagnosing, treating 
or preventing disorders associated with expression of HGPRP. 



Applicants: John A. Salon, et al. 
Serial No.: 10/825,581 
Filed: April 15,2004 

For: DNA Encoding A Human Melanin 
Concentrating Hormone Receptor (MCH1) And 
Uses Thereof EXHIBIT 5 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


FI 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


. ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine- 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


uz 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Vic* Nam 


CG 


Congo 


KK 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


ZW 


Zimbabwe 


CI 


Cdte d* I voire 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






CU 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






CZ 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






DE 


Germany 


LI 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







WO 00/15793 



PCT/US99/20958 



HUMAN GPCR PROTEINS 

TECHNICAL FIELD 

This invention relates to nucleic acid and amino acid sequences of human GPCR proteins 
and to the use of these sequences in the diagnosis, treatment, and prevention of cell proliferative, 
5 neurological, and immune disorders. 

BACKGROUND OF THE INVENTION 

The term receptor describes proteins that specifically recognize other molecules. The 
category is broad and includes proteins with a variety of functions. The bulk of the proteins 

1 0 termed receptors are cell surface proteins which bind extracellular ligands, leading to cellular 
responses including growth, differentiation, endocytosis, and immune response. Other proteins 
termed receptors facilitate the specific transport of proteins across the endoplasmic reticulum 
membrane and localize enzymes to a particular location in the cell. 

G protein coupled receptors (GPCR) are a superfamily of integral membrane proteins 

1 5 which transduce extracellular signals. GPCRs include receptors for biogenic amines; for lipid 
mediators of inflammation, peptide hormones, and sensory signal mediators. The GPCR becomes 
activated when the receptor binds its extracellular ligand. Conformational changes in the GPCR, 
which result from the ligand-receptor interaction, affect the binding affinity of a G protein to the 
GPCR intracellular domains. This enables GTP to bind with enhanced affinity to the G protein. 
20 Activation of the G protein by GTP leads to the interaction of the G protein a subunit with 

adenylate cyclase or other second messenger molecule generators. This interaction regulates the 
activity of adenylate cyclase and hence production of a second messenger molecule, cAMP. 
cAMP regulates phosphorylation and activation of other intracellular proteins. Alternatively, 
cellular levels of other second messenger molecules, such as cGMP or eicosinoids, may be 
25 upregulated or downregulated by the activity of GPCRs. The G protein a subunit is deactivated by 
hydrolysis of the GTP by GTPase, and the p, y, and a subunits reassociate. The heterotrimeric G 
protein then dissociates from the adenylate cyclase or other second messenger molecule generator. 
Activity of GPCR may also be regulated by phosphorylation of the intra- and extracellular 
domains or loops. 

30 Visual excitation and the phototransmission of light signals is a signaling cascade in which 

GPCRs play an important role. The process begins in retinal rod cells with the absorption of light 
by the photoreceptor rhodopsin, a GPCR composed of a 40-kDa protein, opsin, and a 
chromophore, 1 1-cis-retinal. The photoisomerization of the retinal chromophore causes a 



1 



WO 00/15793 



PCT/US99/20958 



conformational change in the opsin GPCR and activation of the associated G-protein, transducin. 
This activation leads to the hydrolysis of cyclic-GMP and the closure of cyclic-GMP regulated, 
Ca^-specific channels in the plasma membrane of the rod cell. The resultant membrane 
hyperpolarization generates a nerve signal. Recovery of the dark state of the rod cell involves the 
5 activation of guanylate cyclase leading to increased cyclic-GMP levels and the reopening of the 
Ca 2+ -specific channels (L. Stryer(1991) J. Biol. Chem. 266:10711-10714). 

Glutamate receptors form a group of GPCRs that are important in neurotransmission. 
Glutamate is the major neurotransmitter in the CNS and is believed to have important roles in 
neuronal plasticity, cognition, memory, learning and some neurological disorders such as epilepsy, 

10 stroke, and neurodegeneration (Watson, S. and S. Arkinstall (1994) The G-Protein Linked 

Receptor Facts Book , Academic Press, San Diego CA, pp. 130-132). These effects of glutamate 
are mediated by two distinct classes of receptors termed ionotropic and metabotropic. Ionotropic 
receptors contain an intrinsic cation channel and mediate fast, excitatory actions of glutamate. 
Metabotropic receptors are modulatory, increasing the membrane excitability of neurons by 

1 5 inhibiting calcium dependent potassium conductances and both inhibiting and potentiating 
excitatory transmission of ionotropic receptors. Metabotropic receptors are classified into five 
subtypes based on agonist pharmacology and signal transduction pathways and are widely 
distributed in brain tissues. 

The vasoactive intestinal polypeptide (VIP) family is a group of related polypeptides 

20 whose actions are also mediated by GPCRs. Key members of this family are VIP itself, secretin, 
and growth hormone releasing factor (GRF). VIP has a wide profile of physiological actions 
including relaxation of smooth muscles, stimulation or inhibition of secretion in various tissues, 
modulation of various immune cell activities, and various excitatory and inhibitory activities in the 
CNS. Secretin stimulates secretion of enzymes and ions in the pancreas and intestine and is also 

25 present in small amounts in the brain. GRF is an important neuroendocrine agent regulating 
synthesis and release of growth hormone from the anterior pituitary (Watson, S. and S. Arkinstall 

supra, pp. 278-283). 

The structure of GPCRs is highly-conserved and consists of seven hydrophobic 
transmembrane (serpentine) regions, cysteine disulfide bridges between the second and third 
30 extracellular loops, an extracellular N-terminus, and a cytoplasmic C-terminus. Three 

extracellular loops alternate with three intracellular loops to link the seven transmembrane regions. 
The most conserved parts of these proteins are the transmembrane regions and the first two 
cytoplasmic loops. A conserved, acidic-Arg-aromatic residue triplet present in the second 
cytoplasmic loop may interact with the G-proteins. The consensus pattern of the G-protein 
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coupled receptors signature (PS00237; SWISSPROT) is characteristic of most proteins belonging 
to this superfamily (Watson, S. and S. Arkinstall supra, pp. 2-6). 

The discovery of new human GPCR proteins and the polynucleotides encoding them 
satisfies a need in the art by providing new compositions which are useful in the diagnosis, 
5 prevention, and treatment of cell proliferative, neurological, and immune disorders. 

SUMMARY OF THE INVENTION 

The invention features substantially purified polypeptides, human GPCR proteins, referred 
to collectively as "HGPRP". In one aspect, the invention provides a substantially purified 

10 polypeptide comprising an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-6, and fragments thereof. 

The invention further provides a substantially purified variant having at least 90% amino 
acid identity to at least one of the amino acid sequences selected from the group consisting of SEQ 
ID NO: 1-6, and fragments thereof. The invention also provides an isolated and purified 

1 5 polynucleotide encoding the polypeptide comprising an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-6, and fragments thereof. The invention also includes an 
isolated and purified polynucleotide variant having at least 70% polynucleotide sequence identity 
to the polynucleotide encoding the polypeptide comprising an amino acid sequence selected from 
the group consisting of SEQ ID NO: 1-6, and fragments thereof. 

20 Additionally, the invention provides an isolated and purified polynucleotide which 

hybridizes under stringent conditions to the polynucleotide encoding the polypeptide comprising 
an amino acid sequence selected from the group consisting of SEQ ID NO: 1 -6, and fragments 
thereof. The invention also provides an isolated and purified polynucleotide having a sequence 
which is complementary to the polynucleotide encoding the polypeptide comprising the amino 

25 acid sequence selected from the group consisting of SEQ ID NO: 1 -6, and fragments thereof. 
The invention also provides an isolated and purified polynucleotide comprising a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:7-12, and fragments 
thereof. The invention further provides an isolated and purified polynucleotide variant having at 
least 70% polynucleotide sequence identity to the polynucleotide sequence selected from the 

30 group consisting of SEQ ID NO:7-12, and fragments thereof. The invention also provides an 
isolated and purified polynucleotide having a sequence which is complementary to the 
polynucleotide comprising a polynucleotide sequence selected from the group consisting of SEQ 
ID NO:7-l 2, and fragments thereof. 

The invention also provides a method for detecting a polynucleotide in a sample 

35 containing nucleic acids, the method comprising the steps of: (a) hybridizing the complement of 
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the polynucleotide sequence to at least one of the polynucleotides of the sample, thereby forming a 
hybridization complex: and (b) detecting the hybridization complex, wherein the presence of the 
hybridization complex correlates with the presence of a polynucleotide in the sample. In one 
aspect, the method further comprises amplifying the polynucleotide prior to hybridization. 
5 The invention further provides an expression vector containing at least a fragment of the 

polynucleotide encoding the polypeptide comprising an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1 -6, and fragments thereof. In another aspect, the expression 
vector is contained within a host cell. 

The invention also provides a method for producing a polypeptide, the method comprising 
1 0 the steps of: (a) culturing the host cell containing an expression vector containing at least a 

fragment of a polynucleotide under conditions suitable for the expression of the polypeptide; and 
(b) recovering the polypeptide from the host cell culture. 

The invention also provides a pharmaceutical composition comprising a substantially 
purified polypeptide having the amino acid sequence selected from the group consisting of SEQ 
1 5 ID NO: 1-6, and fragments thereof, in conjunction with a suitable pharmaceutical carrier. 

The invention further includes a purified antibody which binds to a polypeptide selected 
from the group consisting of SEQ ID NO: 1-6, and fragments thereof. The invention also provides 
a purified agonist and a purified antagonist to the polypeptide. 

The invention also provides a method for treating or preventing a disorder associated with 
20 decreased expression or activity of HGPRP, the method comprising administering to a subject in 
need of such treatment an effective amount of a pharmaceutical composition comprising a 
substantially purified polypeptide having the amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-6, and fragments thereof, in conjunction with a suitable 
pharmaceutical carrier. 

25 The invention also provides a method for treating or preventing a disorder associated with 

increased expression or activity of HGPRP, the method comprising administering to a subject in 
need of such treatment an effective amount of an antagonist of a polypeptide having an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-6, and fragments thereof. 

3 o BRIEF DESCRIPTION OF THE TABLES 

Table 1 shows nucleotide and polypeptide sequence identification numbers (SEQ ID NO), 
clone identification numbers (clone ID), cDNA libraries, and cDNA fragments used to assemble 
full-length sequences encoding HGPRP. 

Table 2 shows features of each polypeptide sequence including potential motifs, 
35 homologous sequences, and methods and algorithms used for identification of HGPRP. 

4 
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Table 3 shows the tissue-specific expression patterns of each nucleic acid sequence as 
determined by northern analysis, conditions, diseases or disorders associated with these tissues, 
and the vector into which each cDNA was cloned. 

Table 4 describes the tissues used to construct the cDNA libraries from which Incyte 
5 clones encoding HGPRP were isolated. 

Table 5 shows the programs, their descriptions, references, and threshold parameters used 
to analyze HGPRP. 

DESCRIPTION OF THE INVENTION 

1 0 Before the present proteins, nucleotide sequences, and methods are described, it is 

understood that this invention is not limited to the particular machines, materials and methods 
described, as these may vary. It is also to be understood that the terminology used herein is for the 
purpose of describing particular embodiments only, and is not intended to limit the scope of the 
present invention which will be limited only by the appended claims. 

1 5 It must be noted that, as used herein and in the appended claims, the singular forms "a," 

"an," and "the" include plural reference unless the context clearly dictates otherwise. Thus, for 
example, a reference to "a host cell" includes a plurality of such host cells, and a reference to "an 
antibody" is a reference to one or more antibodies and equivalents thereof known to those skilled 
in the art, and so forth. 

20 Unless defined otherwise, all technical and scientific terms used herein have the same 

meanings as commonly understood by one of ordinary skill in the art to which this invention 
belongs. Although any machines, materials, and methods similar or equivalent to those described 
herein can be used to practice or test the present invention, the preferred machines, materials and 
methods are now described. All publications mentioned herein are cited for the purpose of 

25 describing and disclosing the cell lines, protocols, reagents and vectors which are reported in the 
publications and which might be used in connection with the invention. Nothing herein is to be 
construed as an admission that the invention is not entitled to antedate such disclosure by virtue of 
prior invention. 
DEFINITIONS 

30 "HGPRP" refers to the amino acid sequences of substantially purified HGPRP obtained 

from any species, particularly a mammalian species, including bovine, ovine, porcine, murine, 
equine, and preferably the human species, from any source, whether natural, synthetic, 
semi-synthetic, or recombinant. 

The term "agonist" refers to a molecule which, when bound to HGPRP, increases or 
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prolongs the duration of the effect of HGPRP. Agonists may include proteins, nucleic acids, 
carbohydrates, or any other molecules which bind to and modulate the effect of HGPRP. 

An "allelic variant" is an alternative form of the gene encoding HGPRP. Allelic variants 
may result from at least one mutation in the nucleic acid sequence and may result in altered 
5 mRNAs or in polypeptides whose structure or function may or may not be altered. Any given 
natural or recombinant gene may have none, one, or many allelic forms. Common mutational 
changes which give rise to allelic variants are generally ascribed to natural deletions, additions, or 
substitutions of nucleotides. Each of these types of changes may occur alone, or in combination 
with the others, one or more times in a given sequence. 

1 0 "Altered" nucleic acid sequences encoding HGPRP include those sequences with 

deletions, insertions, or substitutions of different nucleotides, resulting in a polynucleotide the 
same as HGPRP or a polypeptide with at least one functional characteristic of HGPRP. Included 
within this definition are polymorphisms which may or may not be readily detectable using a 
particular oligonucleotide probe of the polynucleotide encoding HGPRP, and improper or 

1 5 unexpected hybridization to allelic variants, with a locus other than the normal chromosomal locus 
for the polynucleotide sequence encoding HGPRP. The encoded protein may also be "altered," 
and may contain deletions, insertions, or substitutions of amino acid residues which produce a 
silent change and result in a functionally equivalent HGPRP. Deliberate amino acid substitutions 
may be made on the basis of similarity in polarity, charge, solubility, hydrophobicity, 

20 hydrophilicity, and/or the amphipathic nature of the residues, as long as the biological or 

immunological activity of HGPRP is retained. For example, negatively charged amino acids may 
include aspartic acid and glutamic acid, positively charged amino acids may include lysine and 
arginine, and amino acids with uncharged polar head groups having similar hydrophilicity values 
may include leucine, isoleucine, and valine; glycine and alanine; asparagine and glutamine; serine 

25 and threonine; and phenylalanine and tyrosine. 

The terms "amino acid" or "amino acid sequence" refer to an oligopeptide, peptide, 
polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or 
synthetic molecules. In this context, "fragments," "immunogenic fragments," or "antigenic 
fragments" refer to fragments of HGPRP which are preferably at least 5 to about 1 5 amino acids in 

30 length, most preferably at least 14 amino acids, and which retain some biological activity or 
immunological activity of HGPRP. Where "amino acid sequence" is recited to refer to an amino 
acid sequence of a naturally occurring protein molecule, "amino acid sequence" and like terms are 
not meant to limit the amino acid sequence to the complete native amino acid sequence associated 
with the recited protein molecule. 
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"Amplification" relates to the production of additional copies of a nucleic acid sequence. 
Amplification is generally carried out using polymerase chain reaction (PCR) technologies well 
known in the art. 

The term ''antagonist" refers to a molecule which, when bound to HGPRP, decreases the 
5 amount or the duration of the effect of the biological or immunological activity of HGPRP. 

Antagonists may include proteins, nucleic acids, carbohydrates, antibodies, or any other molecules 
which decrease the effect of HGPRP. 

The term "antibody" refers to intact molecules as well as to fragments thereof, such as 
Fab, F(ab 5 ) 2 , and Fv fragments, which are capable of binding the epitopic determinant. Antibodies 
1 0 that bind HGPRP polypeptides can be prepared using intact polypeptides or using fragments 
containing small peptides of interest as the immunizing antigen. The polypeptide or oligopeptide 
used to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the translation of 
RNA, or synthesized chemically, and can be conjugated to a carrier protein if desired. Commonly 
used carriers that are chemically coupled to peptides include bovine serum albumin, thyrogiobulin, 
1 5 and keyhole limpet hemocyanin (KLH). The coupled peptide is then used to immunize the animal. 
The term "antigenic determinant" refers to that fragment of a molecule (i.e., an epitope) 
that makes contact with a particular antibody. When a protein or a fragment of a protein is used to 
immunize a host animal, numerous regions of the protein may induce the production of antibodies 
which bind specifically to antigenic determinants (given regions or three-dimensional structures on 
20 the protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen 
used to elicit the immune response) for binding to an antibody. 

The term "antisense" refers to any composition containing a nucleic acid sequence which 
is complementary to the "sense" strand of a specific nucleic acid sequence. Antisense molecules 
may be produced by any method including synthesis or transcription. Once introduced into a cell, 
25 the complementary nucleotides combine with natural sequences produced by the cell to form 
duplexes and to block either transcription or translation. The designation "negative" can refer to 
the antisense strand, and the designation "positive" can refer to the sense strand. 

The term "biologically active" refers to a protein having structural, regulatory, or 
biochemical functions of a naturally occurring molecule. Likewise, "immunologically active" 
30 refers to the capability of the natural, recombinant, or synthetic HGPRP, or of any oligopeptide 
thereof, to induce a specific immune response in appropriate animals or cells and to bind with 
specific antibodies. 

The terms "complementary" or "complementarity" refer to the natural binding of 
polynucleotides by base pairing. For example, the sequence "5' A-G-T 3'" bonds to the 
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complementary sequence "3* T-C-A 5\" Complementarity between two single-stranded molecules 
may be "partial," such that only some of the nucleic acids bind, or it may be "complete," such that 
total complementarity exists between the single stranded molecules. The degree of 
complementarity between nucleic acid strands has significant effects on the efficiency and strength 

5 of the hybridization between the nucleic acid strands. This is of particular importance in 
amplification reactions, which depend upon binding between nucleic acids strands, and in the 
design and use of peptide nucleic acid (PNA) molecules. 

A "composition comprising a given polynucleotide sequence" or a "composition 
comprising a given amino acid sequence" refer broadly to any composition containing the given 

1 0 polynucleotide or amino acid sequence. The composition may comprise a dry formulation or an 
aqueous solution. Compositions comprising polynucleotide sequences encoding HGPRP or 
fragments of HGPRP may be employed as hybridization probes. The probes may be stored in 
freeze-dried form and may be associated with a stabilizing agent such as a carbohydrate. In 
hybridizations, the probe may be deployed in an aqueous solution containing salts (e.g., NaCl), 

1 5 detergents (e.g., sodium dodecyl sulfate; SDS), and other components (e.g., Denhardt's solution, 
dry milk, salmon sperm DNA, etc.). 

"Consensus sequence" refers to a nucleic acid sequence which has been resequenced to 
resolve uncalled bases, extended using the XL-PCR kit (PE Biosystems, Foster City CA) in the 5' 
and/or the 3' direction, and resequenced, or which has been assembled from the overlapping 

20 sequences of more than one Incyte Clone using a computer program for fragment assembly, such 
as the GEL VIEW Fragment Assembly system (GCG, Madison WI). Some sequences have been 
both extended and assembled to produce the consensus sequence. 

The term "correlates with expression of a polynucleotide" indicates that the detection of 
the presence of nucleic acids, the same or related to a nucleic acid sequence encoding HGPRP, by 

25 northern analysis is indicative of the presence of nucleic acids encoding HGPRP in a sample, and 
thereby correlates with expression of the transcript from the polynucleotide encoding HGPRP. 

A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the 
absence of one or more amino acid residues or nucleotides. 

The term "derivative" refers to the chemical modification of a polypeptide sequence, or a 

30 polynucleotide sequence. Chemical modifications of a polynucleotide sequence can include, for 
example, replacement of hydrogen by an alkyl, acyl, or amino group. A derivative polynucleotide 
encodes a polypeptide which retains at least one biological or immunological function of the 
natural molecule. A derivative polypeptide is one modified by glycosylation, pegylation, or any 
similar process that retains at least one biological or immunological function of the polypeptide 
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from which it was derived. 

The term "similarity" refers to a degree of complementarity. There may be partial 
similarity or complete similarity. The word "identity" may substitute for the word "similarity". A 
partially complementary sequence that at least partially inhibits an identical sequence from 
5 hybridizing to a target nucleic acid is referred to as "substantially similar." The inhibition of 
hybridization of the completely complementary sequence to the target sequence may be examined 
using a hybridization assay (Southern or northern blot, solution hybridization, and the like) under 
conditions of reduced stringency. A substantially similar sequence or hybridization probe will 
compete for and inhibit the binding of a completely similar (identical) sequence to the target 

1 0 sequence under conditions of reduced stringency. This is not to say that conditions of reduced 
stringency are such that non-specific binding is permitted, as reduced stringency conditions 
require that the binding of two sequences to one another be a specific (i.e., a selective) interaction. 
The absence of non-specific binding may be tested by the use of a second target sequence which 
lacks even a partial degree of complementarity (e.g., less than about 30% similarity or identity). 

1 5 In the absence of non-specific binding, the substantially similar sequence or probe will not 
hybridize to the second non-complementary target sequence. 

The phrases "percent identity" or "% identity" refer to the percentage of sequence 
similarity found in a comparison of two or more amino acid or nucleic acid sequences. Percent 
identity can be determined electronically, e.g., by using the MEGALIGN program (DNASTAR, 

20 Madison WI) which creates alignments between two or more sequences according to methods 
selected by the user, e.g., the clustal method. (See, e.g., Higgins, D.G. and P.M. Sharp (1988) 
Gene 73:237-244.) The clustal algorithm groups sequences into clusters by examining the 
distances between all pairs. The clusters are aligned pairwise and then in groups. The percentage 
similarity between two amino acid sequences, e.g., sequence A and sequence B, is calculated by 

25 dividing the length of sequence A, minus the number of gap residues in sequence A, minus the 
number of gap residues in sequence B, into the sum of the residue matches between sequence A 
and sequence B, times one hundred. Gaps of low or of no similarity between the two amino acid 
sequences are not included in determining percentage similarity. Percent identity between nucleic 
acid sequences can also be counted or calculated by other methods known in the art, e.g., the Jotun 

30 Hein method. (See, e.g., Hein, J. (1990) Methods Enzymol. 183:626-645.) Identity between 
sequences can also be determined by other methods known in the art, e.g., by varying 
hybridization conditions. 

"Human artificial chromosomes" (HACs) are linear microchromosomes which may 
contain DNA sequences of about 6 kb to 10 Mb in size, and which contain all of the elements 
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required for stable mitotic chromosome segregation and maintenance. 

The term "humanized antibody" refers *o antibody molecules in which the amino acid 
sequence in the non-antigen binding regions has been altered so that the antibody more closely 
resembles a human antibody, and still retains its original binding ability. 
5 "Hybridization" refers to any process by which a strand of nucleic acid binds with a 

complementary strand through base pairing. 

The term "hybridization complex" refers to a complex formed between two nucleic acid 
sequences by virtue of the formation of hydrogen bonds between complementary bases. A 
hybridization complex may be formed in solution (e.g., C 0 t or Rot analysis) or formed between one 
10 nucleic acid sequence present in solution and another nucleic acid sequence immobilized on a 
solid support (e.g., paper, membranes, filters, chips, pins or glass slides, or any other appropriate 
substrate to which cells or their nucleic acids have been fixed). 

The words "insertion" or "addition" refer to changes in an amino acid or nucleotide 
sequence resulting in the addition of one or more amino acid residues or nucleotides, respectively, 
15 to the sequence found in the naturally occurring molecule. 

"Immune response" can refer to conditions associated with inflammation, trauma, immune 
disorders, or infectious or genetic disease, etc. These conditions can be characterized by 
expression of various factors, e.g., cytokines, chemokines, and other signaling molecules, which 
may affect cellular and systemic defense systems. 
20 The term "microarray" refers to an arrangement of distinct polynucleotides on a substrate. 

The terms "element" or "array element" in a microarray context, refer to hybridizable 
polynucleotides arranged on the surface of a substrate. 

The term "modulate" refers to a change in the activity of HGPRP. For example, 
modulation may cause an increase or a decrease in protein activity, binding characteristics, or any 
25 other biological, functional, or immunological properties of HGPRP. 

The phrases "nucleic acid" or "nucleic acid sequence," as used herein, refer to a 
nucleotide, oligonucleotide, polynucleotide, or any fragment thereof. These phrases also refer to 
DNA or RNA of genomic or synthetic origin which may be single-stranded or double-stranded 
and may represent the sense or the antisense strand, to peptide nucleic acid (PNA), or to any 
30 DNA-like or RNA-iike material. In this context, "fragments" refers to those nucleic acid 

sequences which comprise a region of unique polynucleotide sequence that specifically identifies 
SEQ ID NO:7-12, for example, as distinct from any other sequence in the same genome. For 
example, a fragment of SEQ ID NO:7-12 is useful in hybridization and amplification technologies 
and in analogous methods that distinguish SEQ IDNO:7-12 from related polynucleotide 
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sequences. A fragment of SEQ ID NO:7-12 is at least about 1 5-20 nucleotides in length. The 
precise length of the fragment of SEQ ID NO:7-12 and the region of SEQ ID NO:7-12 to which 
the fragment corresponds are routinely determinable by one of ordinary skill in the art based on 
the intended purpose for the fragment. In some cases, a fragment, when translated, would produce 
5 polypeptides retaining some functional characteristic, e.g., antigenicity, or structural domain 
characteristic, e.g., ATP-binding site, of the full-length polypeptide. 

The terms "operably associated" or "operably linked" refer to functionally related nucleic 
acid sequences. A promoter is operably associated or operably linked with a coding sequence if 
the promoter controls the translation of the encoded polypeptide. While operably associated or 
1 0 operably linked nucleic acid sequences can be contiguous and in the same reading frame, certain 
genetic elements, e.g., repressor genes, are not contiguously linked to the sequence encoding the 
polypeptide but still bind to operator sequences that control expression of the polypeptide. 

The term "oligonucleotide" refers to a nucleic acid sequence of at least about 6 
nucleotides to 60 nucleotides, preferably about 15 to 30 nucleotides, and most preferably about 20 
15 to 25 nucleotides, which can be used in PCR amplification or in a hybridization assay or 
microarray. "Oligonucleotide" is substantially equivalent to the terms "amplimer," "primer," 
"oligomer," and "probe," as these terms are commonly defined in the art. 

"Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which 
comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone 
20 of amino acid residues ending in lysine. The terminal lysine confers solubility to the composition. 
PNAs preferentially bind complementary single stranded DNA or RNA and stop transcript 
elongation, and may be pegylated to extend their lifespan in the cell. 

The term "sample" is used in its broadest sense. A sample suspected of containing nucleic 
acids encoding HGPRP, or fragments thereof, or HGPRP itself, may comprise a bodily fluid: an 
25 extract from a cell, chromosome, organelle, or membrane isolated from a cell; a cell; genomic 
DNA, RNA, or cDNA, in solution or bound to a substrate; a tissue; a tissue print; etc. 

The terms "specific binding" or "specifically binding" refer to that interaction between a 
protein or peptide and an agonist, an antibody, or an antagonist. The interaction is dependent upon 
the presence of a particular structure of the protein, e.g., the antigenic determinant or epitope, 
30 recognized by the binding molecule. For example, if an antibody is specific for epitope "A," the 
presence of a polypeptide containing the epitope A, or the presence of free unlabeled A, in a 
reaction containing free labeled A and the antibody will reduce the amount of labeled A that binds 
to the antibody. 

The term "stringent conditions" refers to conditions which permit hybridization between 
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polynucleotides and the claimed polynucleotides. Stringent conditions can be defined by salt 
concentration, the concentration of organic solvent, e.g., formamide, temperature, and other 
conditions well known in the art. In particular, stringency can be increased by reducing the 
concentration of salt, increasing the concentration of formamide, or raising the hybridization 
5 temperature. 

The term "substantially purified" refers to nucleic acid or amino acid sequences that are 
removed from their natural environment and are isolated or separated, and are at least about 60% 
free, preferably about 75% free, and most preferably about 90% free from other components with 
which they are naturally associated. 

10 A "substitution" refers to the replacement of one or more amino acids or nucleotides by 

different amino acids or nucleotides, respectively. 

"Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, 
chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
microparticles and capillaries. The substrate can have a variety of surface forms, such as wells, 

15 trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 

"Transformation" describes a process by which exogenous DNA enters and changes a 
recipient cell. Transformation may occur under natural or artificial conditions according to 
various methods well known in the art, and may rely on any known method for the insertion of 
foreign nucleic acid sequences into a prokaryotic or eukaryotic host cell. The method for 

20 transformation is selected based on the type of host cell being transformed and may include, but is 
not limited to, viral infection, electroporation, heat shock, lipofection, and particle bombardment. 
The term "transformed" cells includes stably transformed cells in which the inserted DNA is 
capable of replication either as an autonomously replicating plasmid or as part of the host 
chromosome, as well as transiently transformed cells which express the inserted DNA or RNA for 

25 limited periods of time. 

A "variant" of HGPRP polypeptides refers to an amino acid sequence that is altered by 
one or more amino acid residues. The variant may have "conservative" changes, wherein a 
substituted amino acid has similar structural or chemical properties (e.g., replacement of leucine 
with isoieucine). More rarely, a variant may have "nonconservative" changes (e.g., replacement 

30 of glycine with tryptophan). Analogous minor variations may also include amino acid deletions or 
insertions, or both. Guidance in determining which amino acid residues may be substituted, 
inserted, or deleted without abolishing biological or immunological activity may be found using 
computer programs well known in the art, for example, LASERGENE software (DNASTAR). 
The term "variant", when used in the context of a polynucleotide sequence, may 
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encompass a polynucleotide sequence related to HGPRP. This definition may also include, for 
example, "allelic" (as defined above), "splice," "species," or "polymorphic" variants. A splice 
variant may have significant identity to a reference molecule, but will generally have a greater or 
lesser number of polynucleotides due to alternate splicing of exons during mRNA processing. The 
5 corresponding polypeptide may possess additional functional domains or an absence of domains. 
Species variants are polynucleotide sequences that vary from one species to another. The resulting 
polypeptides generally will have significant amino acid identity relative to each other. A 
polymorphic variant is a variation in the polynucleotide sequence of a particular gene between 
individuals of a given species. Polymorphic variants also may encompass "single nucleotide 
10 polymorphisms" (SNPs) in which the polynucleotide sequence varies by one base. The presence 
of SNPs may be indicative of, for example, a certain population, a disease state, or a propensity for 
a disease state. 
THE INVENTION 

The invention is based on the discovery of new human GPCR proteins (HGPRP), the 

1 5 polynucleotides encoding HGPRP, and the use of these compositions for the diagnosis, treatment, 
or prevention of cell proliferative, neurological, and immune disorders. 

Table 1 lists the Incyte Clones used to derive ftill length nucleotide sequences encoding 
HGPRP. Columns 1 and 2 show the sequence identification numbers (SEQ ID NO) of the amino 
acid and nucleic acid sequences, respectively. Column 3 shows the Clone ID of the Incyte Clone 

20 in which nucleic acids encoding each HGPRP were identified, and column 4, the cDNA libraries 
from which these clones were isolated. Column 5 shows Incyte clones, their corresponding cDNA 
libraries, and shotgun sequences. The clones and shotgun sequences are part of the consensus 
nucleotide sequence of each HGPRP and are useful as fragments in hybridization technologies. 
The columns of Table 2 show various properties of the polypeptides of the invention: 

25 column 1 references the SEQ ID NO; column 2 shows the number of amino acid residues in each 
polypeptide; column 3, potential phosphorylation sites; column 4, potential glycosylation sites; 
column 5, the amino acid residues comprising signature sequences and motifs; column 6, the 
identity of each protein; and column 7, analytical methods used to identify each protein through 
sequence homology and protein motifs. 

30 The columns of Table 3 show the tissue-specificity and diseases, disorders, or conditions 

associated with nucleotide sequences encoding HGPRP. The first column of Table 3 lists the 
polypeptide sequence identifiers. The second column lists tissue categories which express HGPRP 
as a fraction of total tissue categories expressing HGPRP. The third column lists the diseases, 
disorders, or conditions associated with those tissues expressing HGPRP. The fourth column lists 
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the vectors used to subclone the cDNA library. 

The following fragments of the nucleotide sequences encoding HGPRP are useful in 
hybridization or amplification technologies to identify SEQ ID NO:7-12 and to distinguish 
between SEQ ID NO:7-12 and related polynucleotide sequences. The useful fragments are the 
5 fragment of SEQ ID NO:7 from about nucleotide 235 to about nucleotide 270; the fragment of 
SEQ ID NO:8 from about nucleotide 2 1 8 to about nucleotide 247; the fragment of SEQ ID NO:9 
from about nucleotide 271 to about nucleotide 300; the fragment of SEQ ID NO: 10 from about 
nucleotide 273 to about nucleotide 303; the fragment of SEQ ID NO: 1 1 from about nucleotide 542 
to about nucleotide 57 1 ; and the fragment of SEQ ID NO: 1 2 from about nucleotide 703 to about 
10 nucleotide 735. 

The invention also encompasses HGPRP variants. A preferred HGPRP variant is one 
which has at least about 80%, more preferably at least about 90%, and most preferably at least 
about 95% amino acid sequence identity to the HGPRP amino acid sequence, and which contains 
at least one functional or structural characteristic of HGPRP. 

15 The invention also encompasses polynucleotides which encode HGPRP. In a particular 

embodiment, the invention encompasses a polynucleotide sequence comprising a sequence 
selected from the group consisting of SEQ ID NO:7-12, which encodes HGPRP. 

The invention also encompasses a variant of a polynucleotide sequence encoding HGPRP. 
In particular, such a variant polynucleotide sequence will have at least about 70%, more preferably 

20 at least about 85%, and most preferably at least about 95% polynucleotide sequence identity to the 
polynucleotide sequence encoding HGPRP. A particular aspect of the invention encompasses a 
variant of a polynucleotide sequence comprising a sequence selected from the group consisting of 
SEQ ID NO:7-12 which has at least about 70%. more preferably at least about 85%, and most 
preferably at least about 95% polynucleotide sequence identity to a nucleic acid sequence selected 

25 from the group consisting of SEQ ID NO:7-12. Any one of the polynucleotide variants described 
above can encode an amino acid sequence which contains at least one functional or structural 
characteristic of HGPRP. 

It will be appreciated by those skilled in the art that, as a result of the degeneracy of the 
genetic code, a multitude of polynucleotide sequences encoding HGPRP, some bearing minimal 

30 similarity to the polynucleotide sequences of any known and naturally occurring gene, may be 
produced. Thus, the invention contemplates each and every possible variation of polynucleotide 
sequence that could be made by selecting combinations based on possible codon choices. These 
combinations are made in accordance with the standard triplet genetic code as applied to the 
polynucleotide sequence of naturally occurring HGPRP, and all such variations are to be 
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considered as being specifically disclosed. 

Although nucleotide sequences which encode HGPRP and its variants are preferably 
capable of hybridizing to the nucleotide sequence of the naturally occurring HGPRP under 
appropriately selected conditions of stringency, it may be advantageous to produce nucleotide 
5 sequences encoding HGPRP or its derivatives possessing a substantially different codon usage, 
e.g., inclusion of non-natural ly occurring codons. Codons may be selected to increase the rate at 
which expression of the peptide occurs in a particular prokaryotic or eukaryotic host in accordance 
with the frequency with which particular codons are utilized by the host. Other reasons for 
substantially altering the nucleotide sequence encoding HGPRP and its derivatives without 
1 0 altering the encoded amino acid sequences include the production of RN A transcripts having more 
desirable properties, such as a greater half-life, than transcripts produced from the naturally 
occurring sequence. 

The invention also encompasses production of DNA sequences which encode HGPRP and 
HGPRP derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the 

1 5 synthetic sequence may be inserted into any of the many available expression vectors and cell 
systems using reagents well known in the art. Moreover, synthetic chemistry may be used to 
introduce mutations into a sequence encoding HGPRP or any fragment thereof. 

Also encompassed by the invention are polynucleotide sequences that are capable of 
hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown in SEQ ID 

20 NO:7- 12 and fragments thereof under various conditions of stringency. (See, e.g., Wahl, G.M. 
and S.L. Berger (1987) Methods Enzymol. 152:399-407; Kimmel, A.R. (1987) Methods Enzymol. 
152:507-51 1.) For example, stringent salt concentration will ordinarily be less than about 750 mM 
NaCI and 75 mM trisodium citrate, preferably less than about 500 mM NaCl and 50 mM trisodium 
citrate, and most preferably less than about 250 mM NaCI and 25 mM trisodium citrate. Low 

25 stringency hybridization can be obtained in the absence of organic solvent, e.g., formamide, while 
high stringency hybridization can be obtained in the presence of at least about 35% formamide. 
and most preferably at least about 50% formamide. Stringent temperature conditions will 
ordinarily include temperatures of at least about 30°C, more preferably of at least about 37°C. and 
most preferably of at least about 42°C. Varying additional parameters, such as hybridization time, 

30 the concentration of detergent, e.g., sodium dodecyl sulfate (SDS), and the inclusion or exclusion 
of carrier DNA, are well known to those skilled in the art. Various levels of stringency are 
accomplished by combining these various conditions as needed. In a preferred embodiment, 
hybridization will occur at 30°C in 750 mM NaCI, 75 mM trisodium citrate, and 1% SDS. In a 
more preferred embodiment, hybridization will occur at 37°C in 500 mM NaCI, 50 mM trisodium 
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citrate, 1% SDS, 35% formamide. and 100 /ig/nil denatured salmon sperm DNA (ssDNA). In a 
most preferred embodiment, hybridization will occur at 42°C in 250 mM NaCI, 25 mM trisodium 
citrate, 1% SDS, 50 % formamide, and 200 fig/ml ssDNA. Useful variations on these conditions 
will be readily apparent to those skilled in the art. 
5 The washing steps which follow hybridization can also vary in stringency. Wash 

stringency conditions can be defined by salt concentration and by temperature. As above, wash 
stringency can be increased by decreasing salt concentration or by increasing temperature. For 
example, stringent salt concentration for the wash steps will preferably be less than about 30 mM 
NaCI and 3 mM trisodium citrate, and most preferably less than about 1 5 mM NaCI and 1 .5 mM 

1 0 trisodium citrate. Stringent temperature conditions for the wash steps will ordinarily include 
temperature of at least about 25°C, more preferably of at least about 42°C, and most preferably of 
at least about 68°C. In a preferred embodiment, wash steps will occur at 25°C in 30 mM NaCI, 3 ■ 
mM trisodium citrate, and 0.1% SDS. In a more preferred embodiment, wash steps will occur at 
42°C in 15 mM NaCI, 1.5 mM trisodium citrate, and 0.1% SDS. In a most preferred embodiment, 

15 wash steps will occur at 68°C in 15 mM NaCI, 1.5 mM trisodium citrate, and 0.1% SDS. 
Additional variations on these conditions will be readily apparent to those skilled in the art. 

Methods for DNA sequencing are well known in the art and may be used to practice any 
of the embodiments of the invention. The methods may employ such enzymes as the Klenow 
fragment of DNA polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq DNA 

20 polymerase (PE Biosystems), thermostable T7 polymerase (Amersham Pharmacia Biotech, 
Piscataway NJ), or combinations of polymerases and proofreading exonucleases such as those 
found in the ELONGASE amplification system (Life Technologies. Gaithersburg MD). 
Preferably, sequence preparation is automated with machines such as the MICROLAB 2200 
system (Hamilton, Reno NV), DNA ENGINE thermal cycler (PTC200; MJ Research, Watertown 

25 MA) and the ABI CATALYST 800 (PE Biosystems). Sequencing is then carried out using either 
ABI PRISM 373 or 377 DNA sequencing systems (PE Biosystems) or the MEGABACE 1000 
DNA sequencing system (Amersham Pharmacia Biotech). The resulting sequences are analyzed 
using a variety of algorithms which are well known in the art. (See, e.g., Ausubel, F.M. (1997) 
Short Protocols in Molecular Biology , John Wiley & Sons, New York NY, unit 7.7; Meyers, R.A. 

30 (1995) Molecular Biology and Biotechnology , Wiley VCH, New York NY, pp. 856-853.) 

The nucleic acid sequences encoding HGPRP may be extended utilizing a partial 
nucleotide sequence and employing various PCR-based methods known in the art to detect 
upstream sequences, such as promoters and regulatory elements. For example, one method which 
may be employed, restriction-site PCR, uses universal and nested primers to amplify unknown 
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sequence from genomic DNA within a cloning vector. (See, e.g., Sarkar, G. (1993) PCR Methods 
Applic. 2:3 18-322.) Another method, inverse PCR, uses primers that extend in divergent 
directions to amplify unknown sequence from a circularized template. The template is derived 
from restriction fragments comprising a known genomic locus and surrounding sequences. (See, 
5 e.g., Triglia, T. et al. (1988) Nucleic Acids Res. 16:8186.) A third method, capture PCR, involves 
PCR amplification of DNA fragments adjacent to known sequences in human and yeast artificial 
chromosome DNA. (See, e.g., Lagerstrom, M. et al. (1991) PCR Methods Applic. 1:1 1 1-1 19.) In 
this method, multiple restriction enzyme digestions and ligations may be used to insert an 
engineered double-stranded sequence into a region of unknown sequence before performing PCR. 
1 0 Other methods which may be used to retrieve unknown sequences are known in the art. (See, e.g., 
Parker, J.D. et al. (1991) Nucleic Acids Res. 19:3055-3060). Additionally, one may use PCR, 
nested primers, and PROMOTERFINDER libraries (Clontech, Palo Alto CA) to walk genomic 
DNA. This procedure avoids the need to screen libraries and is useful in finding intron/exon 
junctions. For all PCR-based methods, primers may be designed using commercially available 
1 5 software, such as OLIGO 4.06 primer analysis software (National Biosciences, Plymouth MN) or 
another appropriate program, to be about 22 to 30 nucleotides in length, to have a GC content of 
about 50% or more, and to anneal to the template at temperatures of about 68°C to 72°C. 

When screening for full-length cDNAs, it is preferable to use libraries that have been 
size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 
20 sequences containing the 5' regions of genes, are preferable for situations in which an oligo d(T) 
library does not yield a full-length cDNA. Genomic libraries may be useful for extension of 
sequence into 5' non-transcribed regulatory regions. 

Capillary electrophoresis systems which are commercially available may be used to 
analyze the size or confirm the nucleotide sequence of sequencing or PCR products. In particular, 
25 capillary sequencing may employ flowable polymers for electrophoretic separation, four different 
nucleotide-specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for 
detection of the emitted wavelengths. Output/light intensity may be converted to electrical signal 
using appropriate software (e.g., GENOTYPER and SEQUENCE NAVIGATOR software, PE 
Biosystems), and the entire process from loading of samples to computer analysis and electronic 
30 data display may be computer controlled. Capillary electrophoresis is especially preferable for 
sequencing small DNA fragments which may be present in limited amounts in a particular sample. 

In another embodiment of the invention, polynucleotide sequences or fragments thereof 
which encode HGPRP may be cloned in recombinant DNA molecules that direct expression of 
HGPRP, or fragments or functional equivalents thereof, in appropriate host cells. Due to the 
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inherent degeneracy of the genetic code, other DNA sequences which encode substantially the 
same or a functionally equivalent amino acid sequence may be produced and used to express 
HGPRP. 

The nucleotide sequences of the present invention can be engineered using methods 
5 generally known in the art in order to alter HGPRP-encoding sequences for a variety of purposes 
including, but not limited to, modification of the cloning, processing, and/or expression of the 
aene product. DNA shuffling by random fragmentation and PCR reassembly of gene fragments 
and synthetic oligonucleotides may be used to engineer the nucleotide sequences. For example, 
oligonucleotide-mediated site-directed mutagenesis may be used to introduce mutations that create 
1 0 new restriction sites, alter glycosylation patterns, change codon preference, produce splice 
variants, and so forth. 

In another embodiment, sequences encoding HGPRP may be synthesized, in whole or in 
part, using chemical methods well known in the art, (See, e.g., Caruthers, M.H. et al. (1980) 
Nucleic Acids Symp. Ser. 7:215-223; Horn, T. et al. (1980) Nucleic Acids Symp. Ser. 7:225-232.) 

1 5 Alternatively, HGPRP itself or a fragment thereof may be synthesized using chemical methods. 
For example, peptide synthesis can be performed using various solid-phase techniques. (See, e.g., 
Roberge, J.Y. et al. (1995) Science 269:202-204.) Automated synthesis may be achieved using 
the ABI 43 1 A peptide synthesizer (PE Biosystems). Additionally, the amino acid sequence of 
HGPRP, or any part thereof, may be altered during direct synthesis and/or combined with 

20 sequences from other proteins, or any part thereof, to produce a variant polypeptide. 

The peptide may be substantially purified by preparative high performance liquid 
chromatography. (See, e.g, Chiez, R.M. and F.Z. Regnier ( 1 990) Methods Enzymol. 182:392- 
421.) The composition of the synthetic peptides may be confirmed by amino acid analysis or by 
sequencing. (See, e.g., Creighton, T. nQ84> Proteins. Structures and Molecular Properties, WH 

25 Freeman, New York NY.) 

In order to express a biologically active HGPRP, the nucleotide sequences encoding 
HGPRP or derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector 
which contains the necessary elements for transcriptional and translational control of the inserted 
coding sequence in a suitable host. These elements include regulatory sequences, such as 

30 enhancers, constitutive and inducible promoters, and 5' and 3' untranslated regions in the vector 
and in polynucleotide sequences encoding HGPRP. Such elements may vary in their strength and 
specificity. Specific initiation signals may also be used to achieve more efficient translation of 
sequences encoding HGPRP. Such signals include the ATG initiation codon and adjacent 
sequences, e.g. the Kozak sequence. In cases where sequences encoding HGPRP and its initiation 
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codon and upstream regulatory sequences are inserted into the appropriate expression vector no 
additional transcriptional or translational control signals may be needed. However, in cases where 
only coding sequence, or a fragment thereof, is inserted, exogenous translational control signals 
including an in-frame ATG initiation codon should be provided by the vector. Exogenous 
5 translational elements and initiation codons may be of various origins, both natural and synthetic. 
The efficiency of expression may be enhanced by the inclusion of enhancers appropriate for the 
particular host cell system used. (See, e.g., Scharf, D. et al. (1994) Results Probl. Cell Differ. 
20:125-162.) 

Methods which are well known to those skilled in the art may be used to construct 
10 expression vectors containing sequences encoding HGPRP and appropriate transcriptional and 
translational control elements. These methods include jn_yjtrp recombinant DNA techniques, 
synthetic techniques, and in vivo genetic recombination. (See, e.g., Sambrook, J. et al. (1989) 
Molecular Cloning. A Laboratory Manual . Cold Spring Harbor Press, Plainview NY, ch. 4, 8, and 
16-17: Ausubel, F.M. et al. ( 1 995) Current Protocols in Molecular Biology , John Wiley & Sons, 
15 New York NY, ch. 9, 13, and 16.) 

A variety of expression vector/host systems may be utilized to contain and express 
sequences encoding HGPRP. These include, but are not limited to, microorganisms such as 
bacteria transformed with recombinant bacteriophage, plasmid, or cosmid DNA expression 
vectors; yeast transformed with yeast expression vectors; insect cell systems infected with viral 
20 expression vectors (e.g., baculovirus); plant cell systems transformed with viral expression vectors 
(e.g., cauliflower mosaic virus, CaMV, or tobacco mosaic virus,TMV) or with bacterial expression 
vectors (e.g., Ti or pBR322 plasmids); or animal cell systems. The invention is not limited by the 
host cell employed. 

In bacterial systems, a number of cloning and expression vectors may be selected 
25 depending upon the use intended for polynucleotide sequences encoding HGPRP. For example, 
routine cloning, subcloning, and propagation of polynucleotide sequences encoding HGPRP can 
be achieved using a multifunctional E. coli vector such as PBLUESCRIPT (Stratagene, La Jolla 
CA) or PSPORT1 plasmid (Life Technologies). Ligation of sequences encoding HGPRP into the 
vector's multiple cloning site disrupts the lacZ gene, allowing a colorimetric screening procedure 
30 for identification of transformed bacteria containing recombinant molecules. In addition, these 
vectors may be useful for in vitro transcription, dideoxy sequencing, single strand rescue with 
helper phage, and creation of nested deletions in the cloned sequence. (See, e.g., Van Heeke, G. 
and S.M. Schuster (1989) J. Biol. Chem. 264:5503-5509.) When large quantities of HGPRP are 
needed, e.g. for the production of antibodies, vectors which direct high level expression of HGPRP 
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may be used. For example, vectors containing the strong, inducible T5 or T7 bacteriophage 
promoter may be used. 

Yeast expression systems may be used for production of HGPRP. A number of vectors 
containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH, 
5 may be used in the yeast Saccharomvces cerevisiae or Pichia pastoris . In addition, such vectors 
direct either the secretion or intracellular retention of expressed proteins and enable integration of 
foreign sequences into the host genome for stable propagation. (See, e.g., Ausubel, 1995. supra : 
Bitter, G.A. et al. (1987) Methods Enzymol. 153:516-544; and Scorer, C.A. et al. (1994) 
Bio/Technology 12:181-184.) 

1 0 Plant systems may also be used for expression of HGPRP. Transcription of sequences 

encoding HGPRP may be driven viral promoters, e.g., the 35S and 19S promoters of CaMV used 
alone or in combination with the omega leader sequence from TMV (Takamatsu, N. (1987) 
EMBO J, 6:307-3 1 1 ). Alternatively, plant promoters such as the small subunit of RUBISCO or 
heat shock promoters may be used. (See, e.g., Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; 

15 Broglie, R. et al. (1984) Science 224:838-843; and Winter, J. et al. (1991) Results Probl. Cell 
Differ. 17:85-105.) These constructs can be introduced into plant cells by direct DNA 
transformation or pathogen-mediated transfection. (See, e.g.. The McGraw Hill Yearbook of 
Science and Technology ( 1 992) McGraw Hill, New York NY, pp. 1 9 1 - 1 96.) 

In mammalian cells, a number of viral-based expression systems may be utilized. In cases 

20 where an adenovirus is used as an expression vector, sequences encoding HGPRP may be ligated 
into an adenovirus transcription/translation complex consisting of the late promoter and tripartite 
leader sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to 
obtain infective virus which expresses HGPRP in host cells. (See, e.g., Logan, J. and T. Shenk 
(1984) Proc. Natl. Acad. Sci. USA 81:3655-3659.) In addition, transcription enhancers, such as 

25 the Rous sarcoma virus (RSV) enhancer, may be used to increase expression in mammalian host 
cells. SV40 or EBV-based vectors may also be used for high-level protein expression. 

Human artificial chromosomes (HACs) may also be employed to deliver larger fragments 
of DNA than can be contained in and expressed .from a plasmid. HACs of about 6 kb to 10 Mb 
are constructed and delivered via conventional delivery methods (liposomes, polycationic amino 

30 polymers, or vesicles) for therapeutic purposes. (See, e.g., Harrington, J.J. et al. (1997) Nat. 
Genet. 15:345-355.) 

For long term production of recombinant proteins in mammalian systems, stable 
expression of HGPRP in cell lines is preferred. For example, sequences encoding HGPRP can be 
transformed into cell lines using expression vectors which may contain viral origins of replication 
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and/or endogenous expression elements and a selectable marker gene on the same or on a separate 
vector. Following the introduction of the vector, cells may be allowed to grow for about 1 to 2 
days in enriched media before being switched to selective media. The purpose of the selectable 
marker is to confer resistance to a selective agent, and its presence allows growth and recovery of 
5 cells which successfully express the introduced sequences. Resistant clones of stably transformed 
cells may be propagated using tissue culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cell lines. These 
include, but are not limited to, the herpes simplex virus thymidine kinase and adenine 
phosphoribosyltransferase genes, for use in tk or apr cells, respectively. (See, e.g., Wigler, M. et 

10 al. (1977) Cell 1 1:223-232; Lowy, I. etal. (1980) Cell 22:817-823.) Also, antimetabolite, 

antibiotic, or herbicide resistance can be used as the basis for selection. For example, dhfr confers 
resistance to methotrexate; neo confers resistance to the aminoglycosides, neomycin and G-418; 
and als and pat confer resistance to chlorsulraron and phosphinotricin acetyltransferase, 
respectively. (See, e.g., Wigler, M. et al. (1980) Proc. Natl. Acad. Sci. USA 77:3567-3570; 

15 Colbere-Garapin, F. et al. (1981) J. Mol. Biol. 150:1-14.) Additional selectable genes have been 
described, e.g., trpB and hisD, which alter cellular requirements for metabolites. (See, e.g., 
Hartman, S.C. and R.C. Mulligan (1988) Proc. Natl. Acad. Sci. USA 85:8047-8051.) Visible 
markers, e.g., anthocyanins, green fluorescent proteins (GFP; Clontech), B glucuronidase and its 
substrate B-glucuronide, or luciferase and its substrate luciferin may be used. These markers can 

20 be used not only to identify transformants, but also to quantify the amount of transient or stable 
protein expression attributable to a specific vector system. (See, e.g., Rhodes, C.A. (1995) 
Methods Mol. Biol. 55:121-131.) 

Although the presence/absence of marker gene expression suggests that the gene of 
interest is also present, the presence and expression of the gene may need to be confirmed. For 

25 example, if the sequence encoding HGPRP is inserted within a marker gene sequence, transformed 
cells containing sequences encoding HGPRP can be identified by the absence of marker gene 
function. Alternatively, a marker gene can be placed in tandem with a sequence encoding HGPRP 
under the control of a single promoter. Expression of the marker gene in response to induction or 
selection usually indicates expression of the tandem gene as well. 

30 In general, host cells that contain the nucleic acid sequence encoding HGPRP and that 

express HGPRP may be identified by a variety of procedures known to those of skill in the art. 
These procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR 
amplification, and protein bioassay or immunoassay techniques which include membrane, 
solution, or chip based technologies for the detection and/or quantification of nucleic acid or 
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protein sequences. 

Immunological methods for detecting and measuring the expression of HGPRP using 
either specific polyclonal or monoclonal antibodies are known in the art. Examples of such 
techniques include enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), 
5 and fluorescence activated cell sorting (FACS). A two-site, monoclonal-based immunoassay 
utilizing monoclonal antibodies reactive to two non-interfering epitopes on HGPRP is preferred, 
but a competitive binding assay may be employed. These and other assays are well known in the 
art. (See, e.g., Hampton, R. et al. noom Serological Methods , a Laboratory Manual. APS Press, 
St. Paul MN, Sect. IV; Coligan, J.E. et al. (1997) Current Protocols in Immunology. Greene Pub. 

10 Associates and Wiley-Interscience, New York NY; and Pound, J.D. (1998) Immunochemical 
Protocols . Humana Press, Totowa NJ.) 

A wide variety of labels and conjugation techniques are known by those skilled in the art 
and may be used in various nucleic acid and amino acid assays. Means for producing labeled 
hybridization or PCR probes for detecting sequences related to polynucleotides encoding HGPRP 

1 5 include oligolabeling, nick translation, end-labeling, or PCR amplification using a labeled 
nucleotide. Alternatively, the sequences encoding HGPRP, or any fragments thereof, may be 
cloned into a vector for the production of an mRNA probe. Such vectors are known in the art, are 
commercially available, and may be used to synthesize RNA probes jnvjtrp by addition of an 
appropriate RNA polymerase such as T7, T3, or.SP6 and labeled nucleotides. These procedures 

20 may be conducted using a variety of commercially available kits, such as those provided by 
Amersham Pharmacia Biotech, Promega (Madison WI), and US Biochemical. Suitable reporter 
molecules or labels which may be used for ease of detection include radionuclides, enzymes, 
fluorescent, chemiluminescent, or chromogenic agents, as well as substrates, cofactors, inhibitors, 
magnetic particles, and the like. 

25 Host cells transformed with nucleotide sequences encoding HGPRP may be cultured 

under conditions suitable for the expression and recovery of the protein from cell culture. The 
protein produced by a transformed cell may be secreted or retained intracellularly depending on 
the sequence and/or the vector used. As will be understood by those of skill in the art, expression 
vectors containing polynucleotides which encode HGPRP may be designed to contain signal 

30 sequences which direct secretion of HGPRP through a prokaryotic or eukaryotic cell membrane. 

In addition, a host cell strain may be chosen for its ability to modulate expression of the 
inserted sequences or to process the expressed protein in the desired fashion. Such modifications 
of the polypeptide include, but are not limited to, acetylation, carboxylation, glycosylation, 
phosphorylation, lipidation, and acylation. Post-translational processing which cleaves a "prepro" 
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form of the protein may also be used to specify protein targeting, folding, and/or activity. 
Different host cells which have specific cellular machinery and characteristic mechanisms for 
post-translational activities (e.g., CHCX HeLa. MDCK, HEK293, and WI38), are available from 
the American Type Culture Collection (ATCC Manassas VA) and may be chosen to ensure the 
5 correct modification and processing of the foreign protein. 

In another embodiment of the invention, natural, modified, or recombinant nucleic acid 
sequences encoding HGPRP may be ligated to a heterologous sequence resulting in translation of 
a fusion protein in any of the aforementioned host systems. For example, a chimeric HGPRP 
protein containing a heterologous moiety that can be recognized by a commercially available 

1 0 antibody may facilitate the screening of peptide libraries for inhibitors of HGPRP activity. 

Heterologous protein and peptide moieties may also facilitate purification of fusion proteins using 
commercially available affinity matrices. Such moieties include, but are not limited to, glutathione 
S-transferase (GST), maltose binding protein (MBP), thioredoxin (Trx), calmodulin binding 
peptide (CBP), 6-His, FLAG, c-myc, and hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-His 

1 5 enable purification of their cognate fusion proteins on immobilized glutathione, maltose, 
phenylarsine oxide, calmodulin, and metal-chelate resins, respectively. FLAG, c-myc, and 
hemagglutinin (HA) enable immunoaffinity purification of fusion proteins using commercially 
available monoclonal and polyclonal antibodies that specifically recognize these epitope tags. A 
fusion protein may also be engineered to contain a proteolytic cleavage site located between the 

20 HGPRP encoding sequence and the heterologous protein sequence, so that HGPRP may be 
cleaved away from the heterologous moiety following purification. Methods for fusion protein 
expression and purification are discussed in Ausubel (1995. supra , ch 10). A variety of 
commercially available kits may also be used to facilitate expression and purification of fusion 
proteins. 

25 In a further embodiment of the invention, synthesis of radiolabeled HGPRP may be 

achieved in vitro using the TNT rabbit reticulocyte lysate or wheat germ extract systems 
(Promega). These systems couple transcription and translation of protein-coding sequences 
operably associated with the T7, T3, or SP6 promoters. Translation takes place in the presence of 
a radiolabeled amino acid precursor, preferably 35 S-methionine. 

30 Fragments of HGPRP may be produced not only by recombinant production, but also by 

direct peptide synthesis using solid-phase techniques. (See, e.g., Creighton.^u^ra, pp. 55-60.) 
Protein synthesis may be performed by manual techniques or by automation. Automated synthesis 
may be achieved, for example, using the ABI 43 1 A peptide synthesizer (PE Biosystems). Various 
fragments of HGPRP may be synthesized separately and then combined to produce the full length 
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molecule. 

THERAPEUTICS 

Chemical and structural similarity, e.g., in the context of sequences and motifs, exists 
between regions of HGPRP and GPCR proteins. In addition, the expression of HGPRP is closely 
5 associated with cell proliferative and immune disorders, and with neurological tissues. Therefore, 
HGPRP appears to play a role in cell proliferative, neurological, and immune disorders. In the 
treatment of disorders associated with increased HGPRP expression or activity, it is desirable to 
decrease the expression or activity of HGPRP. In the treatment of disorders associated with 
decreased HGPRP expression or activity, it is desirable to increase the expression or activity of 
10 HGPRP. 

Therefore, in one embodiment, HGPRP or a fragment or derivative thereof may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of HGPRP. Examples of such disorders include, but are not limited to, a cell proliferative 
disorder such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, 

1 5 mixed connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, 
polycythemia vera, psoriasis, primary thrombocythemia; cancers including adenocarcinoma, 
leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in particular, cancers 
of the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall bladder, ganglia, 
gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, penis, 

20 prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus; an immune disorder such 
as acquired immunodeficiency syndrome (AIDS), Addison's disease, adult respiratory distress 
syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, atherosclerosis, 
autoimmune hemolytic anemia, autoimmune thyroiditis, bronchitis, cholecystitis, contact 
dermatitis, Crohn's disease, atopic dermatitis, dermatornyositis, diabetes mellitus, emphysema, 

25 episodic lymphopenia with lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, 
atrophic gastritis, glomerulonephritis, Goodpasture's syndrome, gout, Graves' disease, 
Hashimoto's thyroiditis, hypereosinophilia, irritable bowel syndrome, multiple sclerosis, 
myasthenia gravis, myocardial or pericardial inflammation, osteoarthritis, osteoporosis, 
pancreatitis, polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, 

30 Sjogren's syndrome, systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, 
thrombocytopenic purpura, ulcerative colitis, uveitis, Werner syndrome, complications of cancer, 
hemodialysis, and extracorporeal circulation, viral, bacterial, fungal, parasitic, protozoal, and 
helminthic infections, and trauma: and a neurological disorder such as epilepsy, ischemic 
cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's disease. Pick's disease, 



24 



WO 00/15793 



PCT/US99/20958 



Huntington's disease, dementia, Parkinson's disease and other extrapyramidal disorders, 
amyotrophic lateral sclerosis and other motor neuron disorders, progressive neural muscular 
atrophy, retinitis pigmentosa, hereditary ataxias; multiple sclerosis and other demyelinating 
diseases, bacterial and viral meningitis, brain abscess, subdural empyema, epidural abscess, 
5 suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral central nervous system 
disease; prion diseases including kuru, Creutzfeldt- Jakob disease, and Gerstmann- 
Straussier-Scheinker syndrome; fatal familial insomnia, nutritional and metabolic diseases of the 
nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, 
encephalotrigeminal syndrome, mental retardation and other developmental disorders of the 
10 central nervous system, cerebral palsy, neuroskeletal disorders, autonomic nervous system 
disorders, cranial nerve disorders, spinal cord diseases, muscular dystrophy and other 
neuromuscular disorders, peripheral nervous system disorders, dermatomyositis and polymyositis; 
inherited, metabolic, endocrine, and toxic myopathies; myasthenia gravis, periodic paralysis: 
mental disorders including mood, anxiety, and schizophrenic disorders; akathesia, amnesia, 
1 5 catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, postherpetic 
neuralgia, and Tourette's disorder. 

In another embodiment, a vector capable of expressing HGPRP or a fragment or 
derivative thereof may be administered to a subject to treat or prevent a disorder associated with 
decreased expression or activity of HGPRP including, but not limited to, those described above. 
20 In a further embodiment, a pharmaceutical composition comprising a substantially 

purified HGPRP in conjunction with a suitable pharmaceutical carrier may be administered to a 
subject to treat or prevent a disorder associated with decreased expression or activity of HGPRP 
including, but not limited to, those provided above. 

In still another embodiment, an agonist which modulates the activity of HGPRP may be 
25 administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of HGPRP including, but not limited to, those listed above. 

In a further embodiment, an antagonist of HGPRP may be administered to a subject to 
treat or prevent a disorder associated increased expression or activity of HGPRP. Examples of 
such disorders include, but are not limited to, those described above. In one aspect, an antibody 
30 which specifically binds HGPRP may be used directly as an antagonist or indirectly as a targeting 
or delivery mechanism for bringing a pharmaceutical agent to cells or tissue which express 
HGPRP. 

In an additional embodiment, a vector expressing the complement of the polynucleotide 
encoding HGPRP may be administered to a subject to treat or prevent a disorder associated 
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increased expression or activity of HGPRP including, but not limited to, those described above. 

In other embodiments, any of the proteins, antagonists, antibodies, agonists, 
complementary sequences, or vectors of the invention may be administered in combination with 
other appropriate therapeutic agents. Selection of the appropriate agents for use in combination 
5 therapy may be made by one of ordinary skill in the art, according to conventional pharmaceutical 
principles. The combination of therapeutic agents may act synergistically to effect the treatment 
or prevention of the various disorders described above. Using this approach, one may be able to 
achieve therapeutic efficacy with lower dosages of each agent, thus reducing the potential for 
adverse side effects. 

1 0 An antagonist of HGPRP may be produced using methods which are generally known in 

the art. In particular, purified HGPRP may be used to produce antibodies or to screen libraries of 
pharmaceutical agents to identify those which specifically bind HGPRP. Antibodies to HGPRP 
may also be generated using methods that are well known in the art. Such antibodies may include, 
but are not limited to, polyclonal, monoclonal, chimeric, and single chain antibodies. Fab 

15 fragments, and fragments produced by a Fab expression library. Neutralizing antibodies (e.g., 
those which inhibit dimer formation) are especially preferred for therapeutic use. 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, 
humans, and others may be immunized by injection with HGPRP or with any fragment or 
oligopeptide thereof which has immunogenic properties. Depending on the host species, various 

20 adjuvants may be used to increase immunological response. Such adjuvants include, but are not 
limited to, Freund's, mineral gels such as aluminum hydroxide, and surface active substances such 
as lysolecithin, pluronic polyols, polyanions. peptides, oil emulsions, KLH, and dinitrophenol. 
Among adjuvants used in humans. BCG (bacilli Calmette-Guerin) and Corvnebacterium parvum 
are especially preferable. 

25 It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to 

HGPRP have an amino acid sequence consisting of at least about 5 amino acids, and. more 
preferably, of at least about 10 amino acids. It is also preferable that these oligopeptides, peptides, 
or fragments are identical to a portion of the amino acid sequence of the natural protein and 
contain the entire amino acid sequence of a small, naturally occurring molecule. Short stretches of 

30 HGPRP amino acids may be fused with those of another protein, such as KLH, and antibodies to 
the chimeric molecule may be produced. 

Monoclonal antibodies to HGPRP may be prepared using any technique which provides 
for the production of antibody molecules by continuous cell lines in culture. These include, but 
are not limited to, the hybridoma technique, the human B-cell hybridoma technique, and the EBV- 
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hybridoma technique. (See, e.g., Kohler, G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. 
(1985) J. Immunol. Methods 81:31-42; Cote, R.J. et al. (1983) Proc. Natl. Acad. Sci. USA 
80:2026-2030: and Cole, S.P. et al. (1984) Mol. Cell Biol. 62:109-120.) 

In addition, techniques developed for the production of "chimeric antibodies/' such as the 
5 splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 
antigen specificity and biological activity, can be used. (See, e.g., Morrison, S.L. et al. (1984) 
Proc. Natl. Acad. Sci. USA 81:6851-6855; Neuberger, M.S. etal. (1 984) Nature 312:604-608: and 
Takeda, S, et al. (1 985) Nature 3 14:452-454.) Alternatively, techniques described for the 
production of single chain antibodies may be adapted, using methods known in the art, to produce 
10 HGPRP-specific single chain antibodies. Antibodies with related specificity, but of distinct 
idiotypic composition, may be generated by chain shuffling from random combinatorial 
immunoglobulin libraries. (See, e.g., Burton, D.R. (1991) Proc. Natl. Acad. Sci. USA 88:10134- 
10137.) 

Antibodies may also be produced by inducing in vivo production in the lymphocyte 

1 5 population or by screening immunoglobulin libraries or panels of highly specific binding reagents 
as disclosed in the literature. (See, e.g., Orlandi, R. et ai. (1989) Proc. Natl. Acad. Sci. USA 
86:3833-3837; Winter, G. et al. (1991) Nature 349:293-299.) 

Antibody fragments which contain specific binding sites for HGPRP may also be 
generated. For example, such fragments include, but are not limited to, F(ab')2 fragments 

20 produced by pepsin digestion of the antibody molecule and Fab fragments generated by reducing 
the disulfide bridges of the F(ab')2 fragments. Alternatively, Fab expression libraries may be 
constructed to allow rapid and easy identification of monoclonal Fab fragments with the desired 
specificity. (See, e.g., Huse, W.D. et al. (1989) Science 246:1275-1281.) 

Various immunoassays may be used for screening to identify antibodies having the 

25 desired specificity. Numerous protocols for competitive binding or immunoradiometric assays 
using either polyclonal or monoclonal antibodies with established specificities are well known in 
the art. Such immunoassays typically involve the measurement of complex formation between 
HGPRP and its specific antibody. A two-site, monoclonal-based immunoassay utilizing 
monoclonal antibodies reactive to two non-interfering HGPRP epitopes is preferred, but a 

30 competitive binding assay may also be employed (Pound, sufira). 

Various methods such as Scatchard analysis in conjunction with radioimmunoassay 
techniques may be used to assess the affinity of antibodies for HGPRP. Affinity is expressed as 
an association constant, K,, which is defined as the molar concentration of HGPRP-antibody 
complex divided by the molar concentrations of free antigen and free antibody under equilibrium 

35 conditions. The K, determined for a preparation of polyclonal antibodies, which are 
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heterogeneous in their affinities for multiple HGPRP epitopes, represents the average affinity, or 
avidity, of the antibodies for HGPRP. The K a determined for a preparation of monoclonal 
antibodies, which are monospecific for a particular HGPRP epitope, represents a true measure of 
affinity. High-affinity antibody preparations with K 8 ranging from about 10 9 to 10 12 L/mole are 
5 preferred for use in immunoassays in which the HGPRP-antibody complex must withstand 

rigorous manipulations. Low-affinity antibody preparations with ranging from about 10 6 to 10 7 
L/mole are preferred for use in immunopurification and similar procedures which ultimately 
require dissociation of HGPRP, preferably in active form, from the antibody (Catty, D. (1988) 
Antibodies, Volume 1: A Practical Approach , IRL Press, Washington, DC; Liddell, J.E. and Cryer, 

10 A. ( 1 99 1 ) A Practical Guide to Monoclonal Antibodies . John Wiley & Sons, New York NY). 

The titer and avidity of polyclonal antibody preparations may be further evaluated to 
determine the quality and suitability of such preparations for certain downstream applications. For 
example, a polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, 
preferably 5-10 mg specific antibody/ml, is preferred for use in procedures requiring precipitation 

1 5 of HGPRP-antibody complexes. Procedures for evaluating antibody specificity, titer, and avidity, 
and guidelines for antibody quality and usage in various applications, are generally available. 
(See, e.g., Catty, supra , and Coligan et al. supra.) 

In another embodiment of the invention, the polynucleotides encoding HGPRP, or any 
fragment or complement thereof, may be used for therapeutic purposes. In one aspect, the 

20 complement of the polynucleotide encoding HGPRP may be used in situations in which it would 
be desirable to block the transcription of the mRNA. In particular, cells may be transformed with 
sequences complementary to polynucleotides encoding HGPRP. Thus, complementary molecules 
or fragments may be used to modulate HGPRP activity, or to achieve regulation of gene function. 
Such technology is now well known in the art, and sense or antisense oligonucleotides or larger 

25 fragments can be designed from various locations along the coding or control regions of sequences 
encoding HGPRP. 

Expression vectors derived from retroviruses, adenoviruses, or herpes or vaccinia viruses, 
or from various bacterial plasmids, may be used for delivery of nucleotide sequences to the 
targeted organ, tissue, or cell population. Methods which are well known to those skilled in the art 
30 can be used to construct vectors to express nucleic acid sequences complementary to the 
polynucleotides encoding HGPRP. (See, e.g., Sambrook, surjra; Ausubel, 1995, supra.) 

Genes encoding HGPRP can be turned off by transforming a cell or tissue with expression 
vectors which express high levels of a polynucleotide, or fragment thereof, encoding HGPRP. 
Such constructs may be used to introduce untranslatable sense or antisense sequences into a cell. 
35 Even in the absence of integration into the DNA, such vectors may continue to transcribe RNA 
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molecules until they are disabled by endogenous nucleases. Transient expression may last for a 
month or more with a non-replicating vector, and may last even longer if appropriate replication 
elements are part of the vector system. 

As mentioned above, modifications of gene expression can be obtained by designing 
5 complementary sequences or antisense molecules (DNA, RNA, or PNA) to the control, 5', or 
regulatory regions of the gene encoding HGPRP. Oligonucleotides derived from the transcription 
initiation site, e.g., between about positions -10 and +10 from the start site, are preferred. 
Similarly, inhibition can be achieved using triple helix base-pairing methodology. Triple helix 
pairing is useful because it causes inhibition of the ability of the double helix to open sufficiently 
1 0 for the binding of polymerases, transcription factors, or regulatory molecules. Recent therapeutic 
advances using triplex DNA have been described in the literature. (See, e.g., Gee, J.E. et al. 
(1994) in Huber, B.E. and B.I. Carr, Molecular and Immunologic Approaches , Futura Publishing, 
Mt. KiscoNY, pp. 163-177.) A complementary sequence or antisense molecule may also be 
designed to block translation of mRNA by preventing the transcript from binding to ribosomes. 
1 5 Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage 

of RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the 
ribozyme molecule to complementary target RNA, followed by endonucleolytic cleavage. For 
example, engineered hammerhead motif ribozyme molecules may specifically and efficiently 
catalyze endonucleolytic cleavage of sequences encoding HGPRP. 
20 Specific ribozyme cleavage sites within any potential RNA target are initially identified by 

scanning the target molecule for ribozyme cleavage sites, including the following sequences: 
GUA, GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 
ribonucleotides, corresponding to the region of the target gene containing the cleavage site, may 
be evaluated for secondary structural features which may render the oligonucleotide inoperable. 
25 The suitability of candidate targets may also be evaluated by testing accessibility to hybridization 
with complementary oligonucleotides using ribonuclease protection assays. 

Complementary ribonucleic acid molecules and ribozymes of the invention may be 
prepared by any method known in the art for the synthesis of nucleic acid molecules. These 
include techniques for chemically synthesizing oligonucleotides such as solid phase 
30 phosphoramidite chemical synthesis. Alternatively, RNA molecules may be generated by in vitro 
and in vivo transcription of DNA sequences encoding HGPRP. Such DNA sequences may be 
incorporated into a wide variety of vectors with suitable RNA polymerase promoters such as T7 or 
SP6. Alternatively, these cDNA constructs that synthesize complementary RNA, constitutively or 
inducibly, can be introduced into cell lines, cells, or tissues. 
35 RNA molecules may be modified to increase intracellular stability and half-life. Possible 
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modifications include, but are not limited to. the addition of flanking sequences at the 5' and/or 3' 
ends of the molecule, or the use of phosphorothioate or 2' Omethyl rather than phosphodiesterase 
linkages within the backbone of the molecule. This concept is inherent in the production of PNAs 
and can be extended in all of these molecules by the inclusion of nontraditional bases such as 
5 inosine, queosine, and wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms 
of adenine, cytidine, guanine, thymine, and uridine which are not as easily recognized by 
endogenous endonucleases. 

Many methods for introducing vectors into cells or tissues are available and equally 
suitable for use in vivo , in vitro , and ex vivo . For ex vivo therapy, vectors may be introduced into 

1 0 stem cells taken from the patient and clonally propagated for autologous transplant back into that 
same patient. Delivery by transfection, by liposome injections, or by polycationic amino polymers 
may be achieved using methods which are well known in the art. (See, e.g., Goldman, C.K. et al. 
(1997) Nat. Biotechnol. 15:462-466.) 

Any of the therapeutic methods described above may be applied to any subject in need of 

1 5 such therapy, including, for example, mammals such as dogs, cats, cows, horses, rabbits, 
monkeys, and most preferably, humans. 

An additional embodiment of the invention relates to the administration of a 
pharmaceutical or sterile composition, in conjunction with a pharmaceutical^ acceptable carrier, 
for any of the therapeutic effects discussed above. Such pharmaceutical compositions may consist 

20 of HGPRP, antibodies to HGPRP, and mimetics, agonists, antagonists, or inhibitors of HGPRP. 
The compositions may be administered alone or in combination with at least one other agent, such 
as a stabilizing compound, which may be administered in any sterile, biocompatible 
pharmaceutical carrier including, but not limited to, saline, buffered saline, dextrose, and water. 
The compositions may be administered to a patient alone, or in combination with other agents, 

25 drugs, or hormones. 

The pharmaceutical compositions utilized in this invention may be administered by any 
number of routes including, but not limited to, oral, intravenous, intramuscular, intra-arterial, 
intramedullary, intrathecal, intraventricular, transdermal, subcutaneous, intraperitoneal, intranasal, 
enteral, topical, sublingual, or rectal means. 

30 In addition to the active ingredients, these pharmaceutical compositions may contain 

suitable pharmaceutically-acceptable carriers comprising excipients and auxiliaries which 
facilitate processing of the active compounds into preparations which can be used 
pharmaceutically. Further details on techniques for formulation and administration may be found 
in the latest edition of Remington's Pharmaceutical Sciences (Maack Publishing, Easton PA). 

35 Pharmaceutical compositions for oral administration can be formulated using 
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pharmaceutical ly acceptable carriers well known in the art in dosages suitable for oral 
administration. Such carriers enable the pharmaceutical compositions to be formulated as tablets, 
pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions, and the like, for ingestion by 
the patient. 

5 Pharmaceutical preparations for oral use can be obtained through combining active 

compounds with solid excipient and processing the resultant mixture of granules (optionally, after 
grinding) to obtain tablets or dragee cores. Suitable auxiliaries can be added, if desired. Suitable 
excipients include carbohydrate or protein fillers, such as sugars, including lactose, sucrose, 
mannitol, and sorbitol; starch from corn, wheat, rice, potato, or other plants; cellulose, such as 

10 methyl cellulose, hydroxypropyimethyl-cellulose, or sodium carboxymethylcellulose; gums, 
including arabic and tragacanth; and proteins, such as gelatin and collagen. If desired, 
disintegrating or solubilizing agents may be added, such as the cross-linked polyvinyl pyrrolidone, 
agar, and alginic acid or a salt thereof, such as sodium alginate. 

Dragee cores may be used in conjunction with suitable coatings, such as concentrated 

1 5 sugar solutions, which may also contain gum arabic, talc, polyvinylpyrrolidone, carbopol gel, 
polyethylene glycol, and/or titanium dioxide, lacquer solutions, and suitable organic solvents or 
solvent mixtures. Dyestuffs or pigments may be added to the tablets or dragee coatings for 
product identification or to characterize the quantity of active compound, i.e., dosage. 

Pharmaceutical preparations which can be used orally include push-fit capsules made of 

20 gelatin, as well as soft, sealed capsules made of gelatin and a coating, such as glycerol or sorbitol. 
Push-fit capsules can contain active ingredients mixed with fillers or binders, such as lactose or 
starches, lubricants, such as talc or magnesium stearate, and, optionally, stabilizers. In soft 
capsules, the active compounds may be dissolved or suspended in suitable liquids, such as fatty 
oils, liquid, or liquid polyethylene glycol with or without stabilizers. 

25 Pharmaceutical formulations suitable for parenteral administration may be formulated in 

aqueous solutions, preferably in physiologically compatible buffers such as Hanks* solution, 
Ringer's solution, or physiologically buffered saline. Aqueous injection suspensions may contain 
substances which increase the viscosity of the suspension, such as sodium carboxymethyl 
cellulose, sorbitol, or dextran. Additionally, suspensions of the active compounds may be 

30 prepared as appropriate oily injection suspensions. Suitable lipophilic solvents or vehicles include 
fatty oils, such as sesame oil, or synthetic fatty acid esters, such as ethyl oleate, triglycerides, or 
liposomes. Non-lipid polycationic amino polymers may also be used for delivery. Optionally, the 
suspension may also contain suitable stabilizers or agents to increase the solubility of the 
compounds and allow for the preparation of highly concentrated solutions. 

35 For topical or nasal administration, penetrants appropriate to the particular barrier to be 
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permeated are used in the formulation. Such penetrants are generally known in the art. 

The pharmaceutical compositions of the present invention may be manufactured in a 
manner that is known in the art, e.g., by means of conventional mixing, dissolving, granulating, 
dragee-making, levigating, emulsifying, encapsulating, entrapping, or lyophilizing processes. 
5 The pharmaceutical composition may be provided as a salt and can be formed with many 

acids, including but not limited to, hydrochloric, sulfuric, acetic, lactic, tartaric, malic, and 
succinic acid. Salts tend to be more soluble in aqueous or other protonic solvents than are the 
corresponding free base forms. In other cases, the preferred preparation may be a lyophilized 
powder which may contain any or all of the following: 1 mM to 50 mM histidine, 0. 1% to 2% 
1 0 sucrose, and 2% to 7% mannitol, at a pH range of 4.5 to 5.5, that is combined with buffer prior to 
use. 

After pharmaceutical compositions have been prepared, they can be placed in an 
appropriate container and labeled for treatment of an indicated condition. For administration of 
HGPRR such labeling would include amount, frequency, and method of administration. 
1 5 Pharmaceutical compositions suitable for use in the invention include compositions 

wherein the active ingredients are contained in an effective amount to achieve the intended 
purpose. The determination of an effective dose is well within the capability of those skilled in the 
art. 

For any compound, the therapeutically effective dose can be estimated initially either in 

20 cell culture assays, e.g., of neoplastic cells or in animal models such as mice, rats, rabbits, dogs, or 
pigs. An animal model may also be used to determine the appropriate concentration range and 
route of administration. . Such information can then be used to determine useful doses and routes 
for administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example 

25 HGPRP or fragments thereof, antibodies of HGPRP, and agonists, antagonists or inhibitors of 
HGPRP, which ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be 
determined by standard pharmaceutical procedures in cell cultures or with experimental animals, 
such as by calculating the ED 50 (the dose therapeutically effective in 50% of the population) or 
LD 50 (the dose lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic 

30 effects is the therapeutic index, and it can be expressed as the LD 50 /ED 5 o ratio. Pharmaceutical 
compositions which exhibit large therapeutic indices are preferred. The data obtained from cell 
culture assays and animal studies are used to formulate a range of dosage for human use. The 
dosage contained in such compositions is preferably within a range of circulating concentrations 
that includes the ED 50 with little or no toxicity. The dosage varies within this range depending 

35 upon the dosage form employed, the sensitivity of the patient, and the route of administration. 
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The exact dosage will be determined by the practitioner, in light of factors related to the 
subject requiring treatment. Dosage and administration are adjusted to provide sufficient levels of 
the active moiety or to maintain the desired effect. Factors which may be taken into account 
include the severity of the disease state, the general health of the subject, the age, weight, and 
5 gender of the subject, time and frequency of administration, drug combination(s), reaction 
sensitivities, and response to therapy. Long-acting pharmaceutical compositions may be 
administered every 3 to 4 days, every week, or biweekly depending on the half-life and clearance 
rate of the particular formulation. 

Normal dosage amounts may vary from about 0.1 /ig to 100,000 ^g, up to a total dose of 

1 0 about 1 gram, depending upon the route of administration. Guidance as to particular dosages and 
methods of delivery is provided in the literature and generally available to practitioners in the art. 
Those skilled in the art will employ different formulations for nucleotides than for proteins or their 
inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular 
cells, conditions, locations, etc. 

15 DIAGNOSTICS 

In another embodiment, antibodies which specifically bind HGPRP may be used for the 
diagnosis of disorders characterized by expression of HGPRP, or in assays to monitor patients 
being treated with HGPRP or agonists, antagonists, or inhibitors of HGPRP. Antibodies useful for 
diagnostic purposes may be prepared in the same manner as described above for therapeutics. 

20 Diagnostic assays for HGPRP include methods which utilize the antibody and a label to detect 
HGPRP in human body fluids or in extracts of cells or tissues. The antibodies may be used with 
or without modification, and may be labeled by covalent or non-covalent attachment of a reporter 
molecule. A wide variety of reporter molecules, several of which are described above, are known 
in the art and may be used. 

25 A variety of protocols for measuring HGPRP, including ELIS As, RIAs, and FACS, are 

known in the art and provide a basis for diagnosing altered or abnormal levels of HGPRP 
expression. Normal or standard values for HGPRP expression are established by combining body 
fluids or cell extracts taken from normal mammalian subjects, preferably human, with antibody to 
HGPRP under conditions suitable for complex formation. The amount of standard complex 

30 formation may be quantitated by various methods, preferably by photometric means. Quantities of 
HGPRP expressed in subject, control, and disease samples from biopsied tissues are compared 
with the standard values. Deviation between standard and subject values establishes the 
parameters for diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding HGPRP may be 

35 used for diagnostic purposes. The polynucleotides which may be used include oligonucleotide 
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sequences, complementary RNA and DNA molecules, and PNAs. The polynucleotides may be 
used to detect and quantitate gene expression in biopsied tissues in which expression of HGPRP 
may be correlated with disease. The diagnostic assay may be used to determine absence, 
presence, and excess expression of HGPRP, and to monitor regulation of HGPRP levels during 
5 therapeutic intervention. 

In one aspect, hybridization with PCR probes which are capable of detecting 
polynucleotide sequences, including genomic sequences, encoding HGPRP or closely related 
molecules may be used to identify nucleic acid sequences which encode HGPRP. The specificity 
of the probe, whether it is made from a highly specific region, e.g., the 5' regulatory region, or 

10 from a less specific region, e.g., a conserved motif, and the stringency of the hybridization or 
amplification (maximal, high, intermediate, or low), will determine whether the probe identifies 
only naturally occurring sequences encoding HGPRP. allelic variants, or related sequences. 

Probes may also be used for the detection of related sequences, and should preferably 
have at least 50% sequence identity to any of the HGPRP encoding sequences. The hybridization 

1 5 probes of the subject invention may be DNA or RNA and may be derived from the sequence of 
SEQ ID NO:7-12 or from genomic sequences including promoters, enhancers, and introns of the 
HGPRP gene. 

Means for producing specific hybridization probes for DN As encoding HGPRP include 
the cloning of polynucleotide sequences encoding HGPRP or HGPRP derivatives into vectors for 

20 the production of mRNA probes. Such vectors are known in the art, are commercially available, 
and may be used to synthesize RNA probes in vitro by means of the addition of the appropriate 
RNA polymerases and the appropriate labeled nucleotides. Hybridization probes may be labeled 
by a variety of reporter groups, for example, by radionuclides such as 52 P or 35 S, or by enzymatic 
labels, such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and 

25 the like. 

Polynucleotide sequences encoding HGPRP may be used for the diagnosis of disorders 
associated with expression of HGPRP. Examples of such disorders include, but are not limited to, 
a cell proliferative disorder such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, 
cirrhosis, hepatitis, mixed connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal 

30 hemoglobinuria, polycythemia vera, psoriasis, primary thrombocythemia; cancers including 
adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma. and, in 
particular, cancers of the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall 
bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, 
parathyroid, penis, prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus; an 

35 immune disorder such as acquired immunodeficiency syndrome (AIDS), Addison's disease, adult 
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respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, 
atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, bronchitis, cholecystitis, 
contact dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes mellitus, 
emphysema, episodic lymphopenia with lymphocytotoxins, erythroblastosis fetalis, erythema 
5 nodosum, atrophic gastritis, glomerulonephritis. Goodpasture's syndrome, gout, Graves' disease, 
Hashimoto's thyroiditis, hypereosinophilia, irritable bowel syndrome, multiple sclerosis, 
myasthenia gravis, myocardial or pericardial inflammation, osteoarthritis, osteoporosis, 
pancreatitis, polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, 
Sjogren's syndrome, systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, 

1 0 thrombocytopenic purpura, ulcerative colitis, uveitis, Werner syndrome, complications of cancer, 
hemodialysis, and extracorporeal circulation, viral, bacterial, fungal, parasitic, protozoal, and 
helminthic infections, and trauma; and a neurological disorder such as epilepsy, ischemic 
cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's disease, Pick's disease, 
Huntington's disease, dementia, Parkinson's disease and other extrapyramidal disorders, 

1 5 amyotrophic lateral sclerosis and other motor neuron disorders, progressive neural muscular 
atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating 
diseases, bacterial and viral meningitis, brain abscess, subdural empyema, epidural abscess, 
suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral central nervous system 
disease; prion diseases including kuru, Creutzfeldt- Jakob disease, and Gerstmann- 

20 Straussler-Scheinker syndrome; fatal familial insomnia, nutritional and metabolic diseases of the 
nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, 
encephalotrigeminai syndrome, mental retardation and other developmental disorders of the 
central nervous system, cerebral palsy, neuroskeletal disorders, autonomic nervous system 
disorders, cranial nerve disorders, spinal cord diseases, muscular dystrophy and other 

25 neuromuscular disorders, peripheral nervous system disorders, dermatomyositis and polymyositis; 
inherited, metabolic, endocrine, and toxic myopathies; myasthenia gravis, periodic paralysis; 
mental disorders including mood, anxiety, and schizophrenic disorders; akathesia. amnesia, 
catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, postherpetic 
neuralgia, and Tourette's disorder. The polynucleotide sequences encoding HGPRP may be used 

30 in Southern or northern analysis, dot blot, or other membrane- based technologies; in PCR 
technologies; in dipstick, pin, and multiformat ELISA-like assays; and in microarrays utilizing 
fluids or tissues from patients to detect altered HGPRP expression. Such qualitative or 
quantitative methods are well known in the art. 

In a particular aspect, the nucleotide sequences encoding HGPRP may be useful in assays 

35 that detect the presence of associated disorders, particularly those mentioned above. The 
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nucleotide sequences encoding HGPRP may be labeled by standard methods and added to a fluid 
or tissue sample from a patient under conditions suitable for the formation of hybridization 
complexes. After a suitable incubation period, the sample is washed and the signal is quantitated 
and compared with a standard value. If the amount of signal in the patient sample is significantly 
5 altered in comparison to a control sample then the presence of altered levels of nucleotide 

sequences encoding HGPRP in the sample indicates the presence of the associated disorder. Such 
assays may also be used to evaluate the efficacy of a particular therapeutic treatment regimen in 
animal studies, in clinical trials, or to monitor the treatment of an individual patient. 

In order to provide a basis for the diagnosis of a disorder associated with expression of 

1 0 HGPRP, a normal or standard profile for expression is established. This may be accomplished by 
combining body fluids or cell extracts taken from normal subjects, either animal or human, with a 
sequence, or a fragment thereof, encoding HGPRP, under conditions suitable for hybridization or 
amplification. Standard hybridization may be quantified by comparing the values obtained from 
normal subjects with values from an experiment in which a known amount of a substantially 

1 5 purified polynucleotide is used. Standard values obtained in this manner may be compared with 
values obtained from samples from patients who are symptomatic for a disorder. Deviation from 
standard values is used to establish the presence of a disorder. 

Once the presence of a disorder is established and a treatment protocol is initiated, 
hybridization assays may be repeated on a regular basis to determine if the level of expression in 

20 the patient begins to approximate that which is observed in the normal subject. The results 
obtained from successive assays may be used to show the efficacy of treatment over a period 
ranging from several days to months. 

With respect to cancer, the presence of an abnormal amount of transcript (either under- or 
overexpressed) in biopsied tissue from an individual may indicate a predisposition for the 

25 development of the disease, or may provide a means for detecting the disease prior to the 

appearance of actual clinical symptoms. A more definitive diagnosis of this type may allow health 
professionals to employ preventative measures or aggressive treatment earlier thereby preventing 
the development or further progression of the cancer. 

Additional diagnostic uses for oligonucleotides designed from the sequences encoding 

30 HGPRP may involve the use of PGR. These oligomers may be chemically synthesized, generated 
enzymatically, or produced in vitro . Oligomers will preferably contain a fragment of a 
polynucleotide encoding HGPRP. or a fragment of a polynucleotide complementary to the 
polynucleotide encoding HGPRP, and will be employed under optimized conditions for 
identification of a specific gene or condition. Oligomers may also be employed under less 

35 stringent conditions for detection or quantitation of closely related DNA or RNA sequences. 
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Methods which may also be used to quantitate the expression of HGPRP include 
radiolabeling or biotinylating nucleotides, coamplification of a control nucleic acid, and 
interpolating results from standard curves. (See, e.g., Melby, P.C. et al. (1993) J. Immunol. 
Methods 159:235-244; Duplaa, C. et al. (1993) Anal. Biochem. 212:229-236.) The speed of 
5 quantitation of multiple samples may be accelerated by running the assay in an ELISA format 
where the oligomer of interest is presented in various dilutions and a spectrophotometric or 
colorimetric response gives rapid quantitation. 

In further embodiments, oligonucleotides or longer fragments derived from any of the 
polynucleotide sequences described herein may be used as targets in a microarray. The 

1 0 microarray can be used to monitor the expression level of large numbers of genes simultaneously 
and to identify genetic variants, mutations, and polymorphisms. This information may be used to 
determine gene function, to understand the genetic basis of a disorder, to diagnose a disorder, and 
to develop and monitor the activities of therapeutic agents. 

Microarrays may be prepared, used, and analyzed using methods known in the art. (See, 

1 5 e.g., Brennan, T.M. et ai. (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) Proc. Natl. 
Acad. Sci. USA 93:10614-10619; Baldeschweiler et al. (1995) PCT application W095/251 1 16; 
Shalon, D. et al. (1995) PCT application WO95/35505; Heller, R.A. et al. (1997) Proc. Natl. Acad. 
Sci. USA 94:2150-2155; and Heller, M.J. etal. (1997) U.S. Patent No. 5,605,662.) 

In another embodiment of the invention, nucleic acid sequences encoding HGPRP may be 

20 used to generate hybridization probes useful in mapping the naturally occurring genomic 

sequence. The sequences may be mapped to a particular chromosome, to a specific region of a 
chromosome, or to artificial chromosome constructions, e.g., human artificial chromosomes 
(HACs), yeast artificial chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial 
PI constructions, or single chromosome cDNA libraries. (See, e.g., Harrington, J.J. et al. (1997) 

25 Nat. Genet. 15:345-355; Price, CM. (1993) Blood Rev. 7:127-134; and Trask, B.J. (1991) Trends 
Genet. 7:149-154.) 

Fluorescent in situ hybridization (FISH) may be correlated with other physical 
chromosome mapping techniques and genetic map data. (See, e.g., Heinz-Ulrich, et al. (1995) in 
Meyers, supra, pp. 965-968.) Examples of genetic map data can be found in various scientific 

30 journals or at the Online Mendelian Inheritance in Man (OMIM) site. Correlation between the 
location of the gene encoding HGPRP on a physical chromosomal map and a specific disorder, or 
a predisposition to a specific disorder, may help define the region of DNA associated with that 
disorder. The nucleotide sequences of the invention may be used to detect differences in gene 
sequences among normal, carrier, and affected individuals. 

35 In situ hybridization of chromosomal preparations and physical mapping techniques, such 
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as linkage analysis using established chromosomal markers, may be used for extending genetic 
maps. Often the placement of a gene on the chromosome of another mammalian species, such as 
mouse, may reveal associated markers even if the number or arm of a particular human 
chromosome is not known. New sequences can be assigned to chromosomal arms by physical 
5 mapping. This provides valuable information to investigators searching for disease genes using 
positional cloning or other gene discovery techniques. Once the disease or syndrome has been 
crudely localized by genetic linkage to a particular genomic region, e.g., ataxia-telangiectasia to 
1 lq22-23, any sequences mapping to that area may represent associated or regulatory genes for 
further investigation. (See, e.g., Gatti, R.A. et al. (1988) Nature 336:577-580.) The nucleotide 

1 0 sequence of the subject invention may also be used to detect differences in the chromosomal 
location due to translocation, inversion, etc., among normal, carrier, or affected individuals. 

In another embodiment of the invention, HGPRP, its catalytic or immunogenic fragments, 
or oligopeptides thereof can be used for screening libraries of compounds in any of a variety of 
drug screening techniques. The fragment employed in such screening may be free in solution. 

1 5 affixed to a solid support, borne on a cell surface, or located intracellularly. The formation of 
binding complexes between HGPRP and the agent being tested may be measured. 

Another technique for drug screening provides for high throughput screening of 
compounds having suitable binding affinity to the protein of interest. (See, e.g., Geysen, et al. 
(1984) PCT application WO84/03564.) In this method, large numbers of different small test 

20 compounds are synthesized on a solid substrate. The test compounds are reacted with HGPRP, or 
fragments thereof, and washed. Bound HGPRP is then detected by methods well known in the art. 
Purified HGPRP can also be coated directly onto plates for use in the aforementioned drug 
screening techniques. Alternatively, non-neutralizing antibodies can be used to capture the 
peptide and immobilize it on a solid support. 

25 In another embodiment, one may use competitive drug screening assays in which 

neutralizing antibodies capable of binding HGPRP specifically compete with a test compound for 
binding HGPRP. In this manner, antibodies can be used to detect the presence of any peptide 
which shares one or more antigenic determinants with HGPRP. 

In additional embodiments, the nucleotide sequences which encode HGPRP may be used 

30 in any molecular biology techniques that have yet to be developed, provided the new techniques 
rely on properties of nucleotide sequences that are currently known, including, but not limited to, 
such properties as the triplet genetic code and specific base pair interactions. 

Without further elaboration, it is believed that one skilled in the art can, using the 
35 preceding description, utilize the present invention to its fullest extent. The following preferred 
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specific embodiments are, therefore, to be construed as merely illustrative, and not limitative of 
the remainder of the disclosure in any way whatsoever. 

The disclosures of all patents, applications, and publications mentioned above and below, 
in particular U.S. Ser. No. 09/156,513, are hereby expressly incorporated by reference. 

5 

EXAMPLES 

I. Construction of cDNA Libraries 

RNA was purchased from Clontech or isolated from tissues described in Table 4. Some 
tissues were homogenized and lysed in guanidinium isothiocyanate, while others were 
10 homogenized and lysed in phenol or in a suitable mixture of denaturants, such as TRIZOL (Life 
Technologies), a monophasic solution of phenol and guanidine isothiocyanate. The resulting 
lysates were centrifuged over CsCI cushions or extracted with chloroform. RNA was precipitated 
from the lysates with either isopropanol or sodium acetate and ethanol, or by other routine 
methods. 

1 5 Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 

purity. In some cases, RNA was treated with DNase. For most libraries, poly(A+) RNA was 
isolated using oligo d(T)-coupled paramagnetic particles (Promega), OLIGOTEX latex particles 
(Qiagen, Valencia CA), or an OLIGOTEX mRNA purification kit (Qiagen). Alternatively, RNA 
was isolated directly from tissue lysates using other RNA isolation kits, e.g., the POLY(A)PURE 

20 mRNA purification kit (Ambion, Austin TX). 

In some cases, Stratagene was provided with RNA and constructed the corresponding 
cDNA libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the 
UNIZAP vector system (Stratagene) or SUPERSCRIPT plasmid system (Life Technologies), 
using the recommended procedures or similar methods known in the art. (See, e.g., AusubeL 

25 1997, supra, units 5. 1-6.6.) Reverse transcription was initiated using oligo d(T) or random 

primers. Synthetic oligonucleotide adapters were ligated to double stranded cDNA, and the cDNA 
was digested with the appropriate restriction enzyme or enzymes. For most libraries, the cDNA 
was size-selected (300-1000 bp) using SEPHACRYL SI 000, SEPHAROSE CL2B, or 
SEPHAROSE CL4B column chromatography (Amersham Pharmacia Biotech) or preparative 

30 agarose gel electrophoresis. cDNAs were ligated into compatible restriction enzyme sites of the 
polylinker of a suitable plasmid, e.g., PBLUESCRIPT plasmid (Stratagene), PSPORT1 plasmid 
(Life Technologies), or plNCY (Incyte Pharmaceuticals, Palo Alto CA). Recombinant plasmids 
were transformed into competent JLcoii cells including XLl-Blue, XLl-BlueMRF, or SOLR from 
Stratagene, or DH5a, DH10B, or ElectroMAX DH10B from Life Technologies. 
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II. Isolation of cDNA Clones 

Plasmids were recovered from host cells by in vivo excision, using the UNIZAP vector 
system (Stratagene) or cell lysis. Plasmids were purified using at least one of the following: a 
MAGIC or WIZARD MINIPREPS DNA purification system (Promega); an AGTC MINIPREP 
5 purification kit (Edge Biosystems, Gaithersburg MD); and QIAWELL 8 Plasmid, QIAWELL 8 
Plus Plasmid. QIAWELL 8 Ultra Plasmid purification systems or the REAL Prep 96 plasmid kit 
from Qiagen. Following precipitation, plasmids were resuspended in 0.1 ml of distilled water and 
stored, with or without lyophilization, at 4°C. 

Alternatively, plasmid DNA was amplified from host cell lysates using direct link PCR in 

10 a high-throughput format (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Host cell lysis and 
thermal cycling steps were carried out in a single reaction mixture. Samples were processed and 
stored in 384-well plates, and the concentration of amplified plasmid DNA was quantified 
fluorometrically using PICOGREEN dye (Molecular Probes, Eugene OR) and a Fluoroskan II 
fluorescence scanner (Labsystems Oy, Helsinki, Finland). 

1 5 III. Sequencing and Analysis 

The cDNAs were prepared for sequencing using the ABI CATALYST 800 (PE 
Biosystems) or the HYDRA microdispenser (Robbins Scientific) or MICROLAB 2200 (Hamilton) 
systems in combination with the DNA ENGINE thermal cyclers (MJ Research). The cDNAs were 
sequenced using the ABI PRISM 373 or 377 sequencing systems (PE Biosystems) and standard 

20 ABI protocols, base calling software, and kits. In one alternative, cDNAs were sequenced using 
the MEGABACE 1000 DNA sequencing system (Amersham Pharmacia Biotech). In another 
alternative, the cDNAs were amplified and sequenced using the ABI PRISM BIGDYE Terminator 
cycle sequencing ready reaction kit (PE Biosystems). In yet another alternative, cDNAs were 
sequenced using solutions and dyes from Amersham Pharmacia Biotech. Reading frames for the 

25 ESTs were determined using standard methods (reviewed in Ausubel, 1997. supra , unit 7.7). 
Some of the cDNA sequences were selected for extension using the techniques disclosed in 
Example V. 

The polynucleotide sequences derived from cDNA, extension, and shotgun sequencing 
were assembled and analyzed using a combination of software programs which utilize algorithms 

30 well known to those skilled in the art. Table 5 summarizes the software programs, descriptions, 
references, and threshold parameters used. The first column of Table 5 shows the tools, programs, 
and algorithms used, the second column provides a brief description thereof, the third column 
presents the references which are incorporated by reference herein, and the fourth column 
presents, where applicable, the scores, probability values, and other parameters used to evaluate 

35 the strength of a match between two sequences (the higher the probability the greater the 
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homology). Sequences were analyzed using MACDNASIS PRO software (Hitachi Software 
Engineering, S. San Francisco CA) and LASERGENE software (DNASTAR). 

The polynucleotide sequences were validated by removing vector, linker, and polyA 
sequences and by masking ambiguous bases, using algorithms and programs based on BLAST, 
5 dynamic programing, and dinucleotide nearest neighbor analysis. The sequences were then 
queried against a selection of public databases such as GENBANK primate, rodent, mammalian, 
vertebrate, and eukaryote databases, and BLOCKS to acquire annotation, using programs based 
on BLAST, FASTA, and BLIMPS. The sequences were assembled into full length polynucleotide 
sequences using programs based on Phred, Phrap, and Consed, and were screened for open 

1 0 reading frames using programs based on GeneMark, BLAST, and FASTA. The full length 
polynucleotide sequences were translated to derive the corresponding full length amino acid 
sequences, and these full length sequences were subsequently analyzed by querying against 
databases such as the GENBANK databases (described above), SWISSPROT, BLOCKS, 
PRINTS, PFAM, and PROSITE. 

1 5 The programs described above for the assembly and analysis of full length polynucleotide 

and amino acid sequences were also used to identify polynucleotide sequence fragments from 
SEQ ID NO:7-12. Fragments from about 20 to about 4000 nucleotides which are useful in 
hybridization and amplification technologies were described in The Invention section above. 
IV. Northern Analysis 

20 Northern analysis is a laboratory technique used to detect the presence of a transcript of a 

gene and involves the hybridization of a labeled nucleotide sequence to a membrane on which 
RNAs from a particular cell type or tissue have been bound. (See, e.g., Sam brook, sugra, ch. 7; 
Ausubel, 1995, supra, ch. 4 and 16.) 

Analogous computer techniques applying BLAST were used to search for identical or 

25 related molecules in nucleotide databases such as GENBANK or LIFESEQ database (Incyte 
Pharmaceuticals). This analysis is much faster than multiple membrane-based hybridizations. In 
addition, the sensitivity of the computer search can be modified to determine whether any 
particular match is categorized as exact or similar. The basis of the search is the product score, 
which is defined as: 

30 % sequence identity x % maximum BLAST score 

100 

The product score takes into account both the degree of similarity between two sequences and the 
length of the sequence match. For example, with a product score of 40, the match will be exact 
within a 1% to 2% error, and, with a product score of 70, the match will be exact. Similar 
35 molecules are usually identified by selecting those which show product scores between 1 5 and 40, 
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although lower scores may identify related molecules. 

The results of northern analyses are reported a percentage distribution of libraries in which 
the transcript encoding HGPRP occurred. Analysis involved the categorization of cDNA libraries 
by organ/tissue and disease. The organ/tissue categories included cardiovascular, dermatologic, 
5 developmental, endocrine, gastrointestinal, hematopoietic/immune, musculoskeletal, nervous, 
reproductive, and urologic. The disease categories included cancer, inflammation/trauma, fetal, 
neurological, and pooled. For each category, the number of libraries expressing the sequence of 
interest was counted and divided by the total number of libraries across all categories. Percentage 
values of tissue-specific and disease expression are reported in Table 3. 

1 0 V. Extension of HGPRP Encoding Polynucleotides 

The full length nucleic acid sequence of SEQ IDNO:7-12 was produced by extension of 
an appropriate fragment of the full length molecule using oligonucleotide primers designed from 
this fragment. One primer was synthesized to initiate 5' extension of the known fragment, and the 
other primer, to initiate 3' extension of the known fragment. The initial primers were designed 

1 5 using OLIGO 4.06 software (National Biosciences), or another appropriate program, to be about 
22 to 30 nucleotides in length, to have a GC content of about 50% or more, and to anneal to the 
target sequence at temperatures of about 68°C to about 72 °C. Any stretch of nucleotides which 
would result in hairpin structures and primer-primer dimerizations was avoided. 

Selected human cDNA libraries were used to extend the sequence. If more than one 

20 extension was necessary or desired, additional or nested sets of primers were designed. 

High fidelity amplification was obtained by PCR using methods well known in the art. 
PCR was performed in 96-well plates using the DNA ENGINE thermal cycler (MJ Research). 
The reaction mix contained DNA template, 200 nmol of each primer, reaction buffer containing 
Mg 2 \ (NH 4 ) 2 S0 4 , and (i-mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), 

25 ELONGASE enzyme (Life Technologies), and Pfu DNA polymerase (Stratagene), with the 

following parameters for primer pair PCI A and PCI B: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; 
Step 3: 60°C, 1 min; Step 4: 68°C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 
68 °C, 5 min; Step 7: storage at 4°C. In the alternative, the parameters for primer pair T7 and SK+ 
were as follows: Step 1 : 94°C 3 3 min; Step 2: 94°C, 15 sec; Step 3: 57°C, 1 min; Step 4: 68°C, 2 

30 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68°C, 5 min; Step 7: storage at 4°C. 
The concentration of DNA in each well was determined by dispensing 100 fxl 
PICOGREEN quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) 
dissolved in IX TE and 0.5 fil of undiluted PCR product into each well of an opaque fluorimeter 
plate (Corning Costar, Acton MA), allowing the DNA to bind to the reagent. The plate was 

35 scanned in a Fluoroskan II (Labsystems Oy, Helsinki. Finland) to measure the fluorescence of the 
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sample and to quantify the concentration of DNA. A 5 fx\ to 10 /il aliquot of the reaction mixture 
was analyzed by electrophoresis on a 1 % agarose mini-gel to determine which reactions were 
successful in extending the sequence. 

The extended nucleotides were desalted and concentrated, transferred to 3 84- well plates, 
5 digested with CviJI cholera virus endonuclease (Molecular Biology Research, Madison WI), and 
sonicated or sheared prior to religation into pUC 1 8 vector (Amersham Pharmacia Biotech). For 
shotgun sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) 
agarose gels, fragments were excised, and agar digested with Agar ACE (Promega). Extended 
clones were religated using T4 ligase (New England Biolabs, Beverly MA) into pUC 18 vector 

10 (Amersham Pharmacia Biotech), treated with Pfu DNA polymerase (Stratagene) to fill-in 
restriction site overhangs, and transfected into competent E. coli cells. Transformed cells were 
selected on antibiotic-containing media, individual colonies were picked and cultured overnight at 
37°C in 384-well plates in LB/2x carb liquid media. 

The cells were lysed, and DNA was amplified by PCR using Taq DNA polymerase 

1 5 (Amersham Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the following 

parameters: Step l:94°C,3min; Step 2: 94 °C, 15 sec; Step3:60°C, 1 min; Step 4: 72 °C, 2 min; 
Step 5: steps 2, 3, and 4 repeated 29 times; Step 6: 72°C, 5 min; Step 7: storage at 4°C. DNA was 
quantified by PICOGREEN reagent (Molecular Probes) as described above. Samples with low 
DNA recoveries were reamplified using the same conditions as described above. Samples were 

20 diluted with 20% dimethysulphoxide (1 :2, v/v), and sequenced using DYENAMIC energy transfer 
sequencing primers and the DYENAMIC DIRECT kit (Amersham Pharmacia Biotech) or the ABI 
PRISM BIGDYE Terminator cycle sequencing ready reaction kit (PE Biosystems). 

In like manner, the nucleotide sequence of SEQ ID NO:7-12 is used to obtain 5* 
regulatory sequences using the procedure above, oligonucleotides designed for such extension, 

25 and an appropriate genomic library. 

VI. Labeling and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ ID NO:7-12 are employed to screen cDNAs, 
genomic DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 
base pairs, is specifically described, essentially the same procedure is used with larger nucleotide 

30 fragments. Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 
software (National Biosciences) and labeled by combining 50 pmol of each oligomer, 250 jiCi of 
[y- 32 P] adenosine triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase 
(DuPont NEN, Boston MA). The labeled oligonucleotides are substantially purified using a 
SEPHADEX G-25 superfine size exclusion dextran bead column (Amersham Pharmacia Biotech). 
5 An aliquot containing 10 7 counts per minute of the labeled probe is used in a typical membrane- 



WO 00/15793 



PCIYUS99/20958 



based hybridization analysis of human genomic DNA digested with one of the following 
endonucleases: Ase I, Bgl II, Eco RI, Pst I, Xba I, or Pvu II (DuPont NEN). 

The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 
membranes (NYTRAN PLUS, Schleicher & Schuell, Durham NH). Hybridization is carried out 
5 for 16 hours at 40°C. To remove nonspecific signals, blots are sequentially washed at room 
temperature under increasingly stringent conditions up to 0.1 x saline sodium citrate and 0.5% 
sodium dodecyl sulfate. After XOMAT-AR film (Eastman Kodak, Rochester NY) is exposed to 
the blots for several hours, hybridization patterns are compared. 

VII. Microarrays 

1 0 A chemical coupling procedure and ah ink jet device can be used to synthesize array 

elements on the surface of a substrate. (See, e.g., Baldeschweiler, supra.) An array analogous to a 
dot or slot blot may also be used to arrange and link elements to the surface of a substrate using 
thermal, UV, chemical, or mechanical bonding procedures. A typical array may be produced by 
hand or using available methods and machines and contain any appropriate number of elements. 

1 5 After hybridization, nonhybridized probes are removed and a scanner used to determine the levels 
and patterns of fluorescence. The degree of complementarity and the relative abundance of each 
probe which hybridizes to an element on the microarray may be assessed through analysis of the 
scanned images. 

Full-length cDNAs, Expressed Sequence Tags (ESTs), or fragments thereof may 
20 comprise the elements of the microarray. Fragments suitable for hybridization can be selected 
using software well known in the art such as LASERGENE software (DNASTAR). Full-length 
cDNAs, ESTs, or fragments thereof corresponding to one of the nucleotide sequences of the 
present invention, or selected at random from a cDNA library relevant to the present invention, are 
arranged on an appropriate substrate, e.g., a glass slide. The cDNA is fixed to the slide using, e.g., 
25 UV cross-linking followed by thermal and chemical treatments and subsequent drying. (See, e.g., 
Schena, M. et al. (1995) Science 270:467-470; Shalon, D. etal. (1996) Genome Res. 6:639-645.) 
Fluorescent probes are prepared and used for hybridization to the elements on the substrate. The 
substrate is analyzed by procedures described above. 

VIII. Complementary Polynucleotides 

30 Sequences complementary to the HGPRP-encoding sequences, or any parts thereof, are 

used to detect, decrease, or inhibit expression of naturally occurring HGPRP. Although use of 
oligonucleotides comprising from about 15 to 30 base pairs is described, essentially the same 
procedure is used with smaller or with larger sequence fragments. Appropriate oligonucleotides 
are designed using OLIGO 4.06 software (National Biosciences) and the coding sequence of 

35 HGPRP. To inhibit transcription, a complementary oligonucleotide is designed from the most 
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unique 5' sequence and used to prevent promoter binding to the coding sequence. To inhibit 
translation, a complementary oligonucleotide is designed to prevent ribosomal binding to the 
HGPRP-encoding transcript. 

IX. Expression of HGPRP 

5 Expression and purification of HGPRP is achieved using bacterial or virus-based 

expression systems. For expression of HGPRP in bacteria, cDNA is subcloned into an appropriate 
vector containing an antibiotic resistance gene and an inducible promoter that directs high levels 
of cDNA transcription. Examples of such promoters include, but are not limited to, the trp-lac 
(tac) hybrid promoter and the T5 or T7 bacteriophage promoter in conjunction with the lac 

1 0 operator regulatory element. Recombinant vectors are transformed into suitable bacterial hosts, 
e.g., BL21(DE3). Antibiotic resistant bacteria express HGPRP upon induction with isopropyl 
beta-D-thiogalactopyranoside (IPTG). Expression of HGPRP in eukaryotic cells is achieved by 
infecting insect or mammalian cell lines with recombinant Autograph ica californica nuclear 
polyhedrosis virus (AcMNPV), commonly known as baculovirus. The nonessential polyhedrin 

1 5 gene of baculovirus is replaced with cDNA encoding HGPRP by either homologous 

recombination or bacterial-mediated transposition involving transfer plasmid intermediates. Viral 
infectivity is maintained and the strong polyhedrin promoter drives high levels of cDNA 
transcription. Recombinant baculovirus is used to infect Spodoptera frugiperda (Sf9) insect cells 
in most cases, or human hepatocytes, in some cases. Infection of the latter requires additional 

20 genetic modifications to baculovirus. (See Engelhard, E.K. et al. (1994) Proc. Natl. Acad. Sci. 
USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 7:1937-1945.) 

In most expression systems, HGPRP is synthesized as a fusion protein with, e.g., 
glutathione S-transferase (GST) or a peptide epitope tag, such as FLAG or 6-His ? permitting rapid, 
single-step, affinity-based purification of recombinant fusion protein from crude cell lysates. 

25 GST, a 26-kilodalton enzyme from Schistosoma iaponicum , enables the purification of fusion 
proteins on immobilized glutathione under conditions that maintain protein activity and 
antigenicity (Amersham Pharmacia Biotech). Following purification, the GST moiety can be 
proteolytically cleaved from HGPRP at specifically engineered sites. FLAG, an 8-amino acid 
peptide, enables immunoaffmity purification using commercially available monoclonal and 

30 polyclonal anti-FLAG antibodies (Eastman Kodak). 6-His, a stretch of six consecutive histidine 
residues, enables purification on metal-chelate resins (QIAGEN). Methods for protein expression 
and purification are discussed in Ausubel (1995. supra , ch 10 and 16). Purified HGPRP obtained 
by these methods can be used directly in the following activity assay. 

X. Demonstration of HGPRP Activity 

35 GPCR activity of HGPRP is determined in a ligand-binding assay using candidate ligand 
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molecules in the presence of ,25 I-labeled HGPRP. HGPRP is labeled with ,25 I Bolton-Hunter 
reagent. (See, e.g., Bolton, A.E. and W.M. Hunter (1973) Biochem. J. 133:529-539.) Candidate 
ligand molecules previously arrayed in the wells of a multi-well plate are incubated with the 
labeled HGPRP, washed, and any wells with labeled HGPRP complex are assayed. Data obtained 
5 using different concentrations of HGPRP are used to calculate values for the number, affinity, and 
association of HGPRP with the ligand molecules. 
XL Functional Assays 

HGPRP function is assessed by expressing the sequences encoding HGPRP at 
physiologically elevated levels in mammalian cell culture systems. cDNA is subcloned into a 

1 0 mammalian expression vector containing a strong promoter that drives high levels of cDN A 
expression. Vectors of choice include pCMV SPORT piasmid (Life Technologies) and pCR3.1 
plasmid (Invitrogen, Carlsbad CA), both of which contain the cytomegalovirus promoter. 5-10 fug 
of recombinant vector are transiently transfected into a human cell line, preferably of endothelial 
or hematopoietic origin, using either liposome formulations or electroporation. 1-2 fug of an 

1 5 additional plasmid containing sequences encoding a marker protein are co-transfected. Expression 
of a marker protein provides a means to distinguish transfected cells from nontransfected cells and 
is a reliable predictor of cDNA expression from the recombinant vector. Marker proteins of 
choice include, e.g., Green Fluorescent Protein (GFP; Clontech), CD64, or a CD64-GFP fusion 
protein. Flow cytometry (FCM), an automated, laser optics-based technique, is used to identify 

20 transfected cells expressing GFP or CD64-GFP, and to evaluate properties, for example, their 
gpoptotic state. FCM detects and quantifies the uptake of fluorescent molecules that diagnose 
events preceding or coincident with cell death. These events include changes in nuclear DNA 
content as measured by staining of DNA with propidium iodide; changes in cell size and 
granularity as measured by forward light scatter and 90 degree side light scatter; down-regulation 

25 of DNA synthesis as measured by decrease in bromodeoxyuridine uptake; alterations in 
expression of cell surface and intracellular proteins as measured by reactivity with specific 
antibodies: and alterations in plasma membrane composition as measured by the binding of 
fluorescein-conjugated Annexin V protein to the cell surface. Methods in flow cytometry are 
discussed in Ormerod, M.G. ( 1 994) Flow Cytometry . Oxford, New York NY. 

30 The influence of HGPRP on gene expression can be assessed using highly purified 

populations of cells transfected with sequences encoding HGPRP and either CD64 or CD64-GFP. 
CD64 and CD64-GFP are expressed on the surface of transfected cells and bind to conserved 
regions of human immunoglobulin G (IgG). Transfected cells are efficiently separated from 
nontransfected cells using magnetic beads coated with either human IgG or antibody against CD64 

35 (DYNAL, Lake Success NY). mRNA can be purified from the cells using methods well known 
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by those of skill in the art. Expression of mRNA encoding HGPRP and other genes of interest can 
be analyzed by northern analysis or microarray techniques. 

XII. Production of HGPRP Specific Antibodies 

HGPRP substantially purified using polyacrylamide gel electrophoresis (PAGE; see. e.g., 
5 Harrington, M.G. ( 1 990) Methods Enzymol. 1 82:488-495), or other purification techniques, is 
used to immunize rabbits and to produce antibodies using standard protocols. 

Alternatively, the HGPRP amino acid sequence is analyzed using LASERGENE software 
(DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is 
synthesized and used to raise antibodies by means known to those of skill in the art. Methods for 
1 0 selection of appropriate epitopes, such as those hear the C-terminus or in hydrophilic regions are 
well described in the art. (See, e.g., Ausubel, 1995, sugra, ch. 1 1 .) 

Typically, oligopeptides 15 residues in length are synthesized using an ABI 431 A peptide 
synthesizer (PE Biosystems) using fmoc-chemistry and coupled to KLH (Sigma-AIdrich, St. 
Louis MO) by reaction with N-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS) to increase 
15 immunogenicity. (See, e.g.. Ausubel. 1995. supra .) Rabbits are immunized with the oligopeptide- 
KLH complex in complete Freund f s adjuvant. Resulting antisera are tested for antipeptide activity 
by, for example, binding the peptide to plastic, blocking with 1% BSA, reacting with rabbit 
antisera, washing, and reacting with radio-iodinated goat anti-rabbit IgG. 

XIII. Purification of Naturally Occurring HGPRP Using Specific Antibodies 

20 Naturally occurring or recombinant HGPRP is substantially purified by irnmunoaffinity 

chromatography using antibodies specific for HGPRP. An irnmunoaffinity column is constructed 
by covalently coupling anti-HGPRP antibody to an activated chromatographic resin, such as 
CNBr-activated SEPHAROSE (Amersham Pharmacia Biotech). After the coupling, the resin is 
blocked and washed according to the manufacturer's instructions. 

25 Media containing HGPRP are passed over the irnmunoaffinity column, and the column is 

washed under conditions that allow the preferential absorbance of HGPRP (e.g., high ionic 
strength buffers in the presence of detergent). The column is eluted under conditions that disrupt 
antibody/HGPRP binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, 
such as urea or thiocyanate ion), and HGPRP is collected. 

30 XIV. Identification of Molecules Which Interact with HGPRP 

HGPRP, or biologically active fragments thereof, are labeled with ,25 I Bolton-Hunter 
reagent. (See, e.g., Bolton and Hunter, supra .) Candidate molecules previously arrayed in the 
wells of a multi-well plate are incubated with the labeled HGPRP, washed, and any wells with 
labeled HGPRP complex are assayed. Data obtained using different concentrations of HGPRP are 

35 used to calculate values for the number, affinity, and association of HGPRP with the candidate 
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molecules. 

Various modifications and variations of the described methods and systems of the 
invention will be apparent to those skilled in the art without departing from the scope and spirit of 
the invention. Although the invention has been described in connection with specific preferred 
5 embodiments, it should be understood that the invention as claimed should not be unduly limited 
to such specific embodiments. Indeed, various modifications of the described modes for carrying 
out the invention which are obvious to those skilled in molecular biology or related fields are 
intended to be within the scope of the following claims. 
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What is claimed is: 



1. 



A substantially purified polypeptide comprising an amino acid sequence selected 
from the'group consisting of SEQ IDNO:l, SEQ IDNO:2, SEQ lDNO:3, SEQ ID NO:4, and 

SEQ ID NO:6, and fragments thereof. 
5 2 . A substantially purified variant having at least 90% amino acid sequence identity 

to the amino acid sequence of claim 1 . 

3. An isolated and purified polynucleotide encoding the polypeptide of claim 1 . 

4. An isolated and purified polynucleotide variant having at least 70% 
polynucleotide sequence identity to the polynucleotide of claim 3. 

,0 5. An isolated and purified polynucleotide which hybridizes under stringent 

conditions to the polynucleotide of claim 3. 

6. An isolated and purified polynucleotide having a sequence which is 
complementary to the polynucleotide of claim 3. 

7. A method for detecting a polynucleotide, the method comprising the steps of: 

, 5 (a) hybridizing the polynucleotide of claim 6 to at least one nucleic acid in a 

sample, thereby forming a hybridization complex; and 

(b) detecting the hybridization complex, wherein the presence of the 
hybridization complex correlates with the presence of the polynucleotide in the sample. 

8. The method of claim 7 further comprising amplifying the polynucleotide prior to 
20 hybridization. 

9 An isolated and purified polynucleotide comprising a polynucleotide sequence 
selected from the group consisting of SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID 
NO: 1 0, and SEQ ID NO: 1 2, and fragments thereof. 

1 0. An isolated and purified polynucleotide variant having at least 70% 
25 polynucleotide sequence identity to the polynucleotide of claim 9. 

11. An isolated and purified polynucleotide having a sequence which is 
complementary to the polynucleotide of claim 9. 

12. An expression vector comprising at least a fragment of the polynucleotide of 

claim 3. 

30 13. A host cell comprising the expression vector of claim 12. 

14. A method for producing a polypeptide, the method comprising the steps of: 

a) culturing the host cell of claim 13 under conditions suitable for the 

expression of the polypeptide; and 

b) recovering the polypeptide from the host cell culture. 
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15. A pharmaceutical composition comprising the polypeptide of claim 1 in 
conjunction with a suitable pharmaceutical carrier. 

16. A purified antibody which specifically binds to the polypeptide of claim 1 . 

17. A purified agonist of the polypeptide of claim 1 . 

5 18. A purified antagonist of the polypeptide of claim 1 . 

19. A method for treating or preventing a disorder associated with decreased 
expression or activity of HGPRP, the method comprising administering to a subject in need of 
such treatment an effective amount of the pharmaceutical composition of claim 15. 

20. A method for treating or preventing a disorder associated with increased 

10 expression or activity of HGPRP, the method comprising administering to a subject in need of 
such treatment an effective amount of the antagonist of claim 18. 
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<110> INCYTE PHARMACEUTICALS, INC. 
BANDMAN, Olga 
LAL, Preeti 
TANG, Y. Tom 
CORLEY, Neil C. 
GUEGLER, Karl J. 
GORGONE, Gina A. 
BAUGHN, Mariah R. 

* <120> HUMAN GPCR PROTEINS 

<130> PF-0597 PCT 

<14 0> To Be Assigned 
<141> Herewith 

<150> 09/156,513 
<151> 1998-09-17 

<160> 12 

<170> PERL Program 



<210> 1 

<211> 441 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 1258981CD1 



<400> 1 








Val 


Met Ala 


He His Lys 

c 


Ala 


Leu 


1 

Phe Leu 


Phe Pro Gly Ala Trp 


Ala 




20 








Cys Ser 


Gin Gly Leu 


Asn 


Pro 


Leu 


35 








Ser Gly 


Ala Trp Gly 


He 


Val 


Leu 




50 








He Val 


Thr Thr Phe 


Val 


Leu 


Thr 




65 








Pro Phe 


Val Gin Asp 


Thr 


Lys 


Lys 




80 








Val Phe 


Phe Leu Leu Gly 


Thr 


Leu 




95 








Ala Cys 


Val Val Lys 


Pro Asp 


Phe 




110 








Phe Leu 


Phe Gly Val 


Leu 


Phe 


Ala 




125 








Ala His 


Val Phe Ala 


Leu 


Asn 


Phe 




140 








Pro Arg 


Gly Trp Val 


He 


Phe 


Thr 



Met 


Cys Leu Gly 


Leu Pro Leu 




10 


15 


Gin 


Gly His Val 


Pro Pro Gly 




25 


30 


Tyr 


Tyr Asn Leu 


Cys Asp Arg 


40 


45 


Glu 


Ala Val Ala Gly Ala Gly 




55 


60 


He 


He Leu Val 


Ala Ser Leu 




70 


75 


Arg 


Ser Leu Leu Gly Thr Gin 


85 


90 


Gly 


Leu Phe Cys 


Leu Val Phe 


100 


105 


Ser 


Thr Cys Ala 


Ser Arg Arg 




115 


120 


He 


Cys Phe Ser 


Cys Leu Ala 




130 


135 


Leu 


Ala Arg Lys 


Asn His Gly 




145 


150 


val 


Ala Leu Leu 


Leu Thr Leu 
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155 








Val Glu Val 


He 


He 


Asn 


Thr 


Glu 






170 








Arg Gly Ser Gly 


Glu Gly 


Gly Pro 






185 








Trp Ala Val Ala 


Ser 


Pro 


Cys 


Ala 






200 








Met Ala Leu 


He 


Tyr 


Val 


Met 


Leu 






215 








Gly Ala Trp 


Pro 


Ala 


Leu 


Cys Gly 






230 








His Gly val 


Phe 


Val 


Leu 


Leu 


Thr 






245 








Trp Val Val 


Trp 


He 


Val 


Met 


Tyr 






260 








Asn Ser Pro 


Thr 


Trp 


Asp 


Asp 


Pro 






275 








Ala Asn Ala 


Trp 


Ala 


Phe 


Val 


Leu 






290 








Ser Gin Val 


Thr 


Lys 


Ser 


Ser 


Pro 






305 








Met Tyr Pro 


Thr 


Arg 


Gly 


Val 


Gly 






320 








Gin Lys Gly Gin 


Ser 


Met 


Phe 


Val 






335 








Asp Glu Pro 


Val 


Ala 


Ala 


Lys Arg 




350 








Tyr Asn Gly Gin 


Leu 


Leu 


Thr 


Ser 






365 








Ala Leu Met 


His 


Lys 


Val 


Pro 


Ser 






380 








Leu Pro Arg 


Ala 


Thr 


Ala 


Asn 


Ser 






395 








Ser Thr Leu Arg 


Ala 


Glu 


Asp 


Met 






410 








Ala Ala Thr 


Pro 


Pro 


Lys 


Asp Gly 






425 








Asn Pro Tyr 


Val 


Trp Asp 










440 









160 




165 


Trp Leu He He Thr 


Leu 


Val 


175 




180 


Gin Gly Asn Ser Ser Ala Gly 


190 




195 

2. J -J 


He Ala Asn raet. 


Phe 


Val 


205 




210 


Leu Leu Leu uiy Aia 


rue 


Leu 


220 




225 


Arg Tyr .Lys nrg up 


7\ yrr 


Lys 


235 




240 


Thr Ala Thr Ser Val 


Ala 


He 


250 




255 


Thr Tyr Gly Asn Lys 


Gin 


His 


265 




270 


Thr Leu Ala He Ala 


Leu 


Ala 


280 




285 


Phe Tyr Val He Pro 


Glu 


Val 


295 




300 


Glu Gin Ser Tyr Gin Gly Asp 


310 




315 


Tyr Glu Thr He Leu 


Lys 


Glu 


325 




330 


Glu Asn Lys Ala Phe 


Ser 


Met 


340 




345 


Pro Val Ser Pro Tyr 


Ser 


Gly 


355 




360 


Val Tyr Gin Pro Thr 


Glu 


Met 


370 




375 


Glu Gly Ala Tyr Asp 


He 


He 


385 




390 


Gin Val Met Gly Ser 


Ala 


Asn 


400 




405 


Tyr Ser Ala Gin Ser 


His 


Gin 


415 




420 


Lys Asn Ser Gin Val 


Phe 


Arg 


430 




435 



<210> 2 

<211> 353 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 1459432CD1 



<400> 2 

Met Asp Leu Glu Ala Ser Leu Leu 

1 5 
Asn Thr Ser Asp Gly Pro Asp Asn 
20 

Pro Arg Thr Gly Ser He Ser Tyr 
35 



Pro Thr Gly Pro Asn Ala Ser 
10 15 

Leu Thr Ser Ala Gly Ser Pro 
25 30 

He Asn He He Met Pro Ser 
40 45 
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Val Phe Gly Thr He Cys Leu Leu Gly He He Gly Asn Ser Thr 

50 55 60 

Val He Phe Ala Val Val Lys Lys Ser Lys Leu His Trp Cys Asn 

65 70 75 

Asn Val Pro Asp He Phe He He Asn Leu Ser Val Val Asp Leu 

80 85 90 

Leu Phe Leu Leu Gly Met Pro Phe Met He His Gin Leu Met Gly 

95 100 105 

Asn Gly Val Trp His Phe Gly Glu Thr Met Cys Thr Leu He Thr 

110 H5 120 

Ala Met Asp Ala Asn Ser Gin Phe Thr Ser Thr Tyr He Leu Thr 

125 130 135 

Ala Met Ala He Asp Arg Tyr Leu Ala Thr Val His Pro He Ser 

140 145 150 

Ser Thr Lys Phe Arg Lys Pro Ser Val Ala Thr Leu Val He Cys 

155 160 165 

Leu Leu Trp Ala Leu Ser Phe He Ser He Thr Pro Val Trp Leu 

170 175 180 

Tyr Ala Arg Leu He Pro Phe Pro Gly Gly Ala Val Gly Cys Gly 

185 190 195 

He Arg Leu Pro Asn Pro Asp Thr Asp Leu Tyr Trp Phe Thr Leu 

200 . 205 210 

Tyr Gin Phe Phe Leu Ala Phe Ala Leu Pro Phe Val Val He Thr 

215 220 225 

Ala Ala Tyr Val Arg He Leu Gin Arg Met Thr Ser Ser Val Ala 

230 235 240 

Pro Thr Ser Gin Arg Ser He Arg Leu Arg Thr Lys Arg Val Thr 

245 250 255 

Arg Thr Ala He Ala He Cys Leu Val Phe Phe Val Cys Trp Ala 

260 265 270 
Pro Tyr Tyr Val Leu Gin Leu Thr Gin Leu Ser He Ser Arg Pro 

275 280 285 

Thr Pro Thr Phe Val Tyr Leu Tyr Asn Ala Ala He Ser Leu Gly 

290 295 300 

Tyr Ala Asn Ser Cys Leu Asn Pro Phe Val Tyr He Val Leu Cys 

305 310 315 

Glu Thr Phe Arg Lys Arg Leu Val Leu Ser Val Lys Pro Ala Ala 

320 325 330 
Gin Gly Gin Leu Arg Ala Val Ser Asn Ala Gin Ala Ala Asp Glu 

335 340 345 
Glu Arg Thr Glu Ser Lys Gly Thr 

350 



<210> 3 

<211> 333 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 2214673CD1 



<400> 3 

Met Trp Ser. Cys Ser 
1 5 
Leu Pro Ala Cys Gin 
20 



Trp Phe Asn Gly Thr 
10 

Asp Leu Gin Leu Gly 
25 



Gly Leu Val Glu Glu 
15 

Leu Ser Leu Leu Ser 
30 
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Leu Leu Gly Leu Val Val Gly Val Pro Val Gly Leu Cys Tyr Asn 
35 40 4b 

Ala Leu Leu Val Leu Ala Asn Leu His Ser Lys Ala Ser Met Thr 
50 55 60 

Met Pro Asp Val Tyr Phe Val Asn Met Ala Val Ala Gly Leu Val 
65 70 75 



Leu Ser Ala Leu Ala Pro Val His Leu Leu Gly Pro Pro Ser Ser 
80 85 90 

Arg Trp Ala Leu Trp Ser Val Gly Gly Glu Val His Val Ala Leu 

Gin lie Pro Phe Asn Val Ser Ser Leu Val Ala Met Tyr Ser Thr 

110 115 120 

Ala Leu Leu Ser Leu Asp His Tyr lie Glu Arg Ala Leu Pro Arg 

125 130 
Thr Tyr Met Ala Ser Val Tyr Asn Thr Arg His Val Cys Gly Phe 

140 145 150 

Val Trp Gly Gly Ala Leu Leu Thr Ser Phe Ser Ser Leu Leu Phe 

155 "0 165 

Tvr lie Cys Ser His Val Ser Thr Arg Ala Leu Glu Cys Ala Lys 

170 175 180 

Met Gin Asn Ala Glu Ala Ala Asp Ala Thr Leu Val Phe He Gly 

185 190 
Tvr Val Val Pro Ala Leu Ala Thr Leu Tyr Ala Leu Val Leu Leu 
Y 200 205 210 

Ser Arg Val Arg Arg Glu Asp Thr Pro Leu Asp Arg Asp Thr Gly 

215 220 225 

Arg Leu Glu Pro Ser Ala His Arg Leu Leu Val Ala Thr Val Cys 

230 235 240 

Thr Gin Phe Gly Leu Trp Thr Pro His Tyr Leu He Leu Leu Gly 

245 250 
His Thr Gly lie lie Ser Arg Gly Lys Pro Val Asp Ala His Tyr 

260 265 270 

Leu Gly Leu Leu His Phe Val Lys Asp Phe Ser Lys Leu Leu Ala 

275 280 
Phe Ser Ser Ser Phe Val Thr Pro Leu Leu Tyr Arg Tyr Met Asn 
" 290 295 300 

Gin Ser Phe Pro Ser Lys Leu Gin Arg Leu Met Lys Lys Leu Pro 

305 310 315 

Cys Gly Asp Arg His Cys Ser Pro Asp His Met Gly Val Gin Gin 

320 325 330 

Val Leu Ala 



<210> 4 

<211> 396 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 2488822CD1 



<400> 4 

Met Phe Arg Pro Leu Val Asn Leu Ser His He Tyr Phe Lys Lys 
5 10 15 

Phe Gin Tyr Cys Gly Tyr Ala Pro His Val Arg Ser Cys Lys Pro 
20 



25 
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Asn Thr Asp Gly lie Ser Ser Leu 
35 

He Gin Arg Val Phe Val Trp Val 
50 

Gly Asn He Phe Val He Cys Met 
65 

Asn Lys Leu Tyr Ala Met Ser He 
80 

Cys Leu Met Gly He Tyr Leu Phe 
95 

Lys Phe Arg Gly Glu Tyr Asn Lys 
110 

Ser Thr His Cys Gin Leu Val Gly 
125 

Glu Val Ser Val Leu Leu Leu Thr 
140 

He Cys He Val Tyr Pro Phe Arg 
155 

Arg Thr He Thr Val Leu He Leu 
170 

Val Ala Phe He Pro Leu Ser Asn 
185 

Tyr Ala Pro Asn Gly Val Cys Phe 
2 00 

Glu Ser He Gly Ala Gin He Tyr 
215 

He Asn Leu Ala Ala Phe He lie 
230 

Met Phe Tyr Ser Val His Gin Ser 
245 

Arg Asn Gin Val Lys Lys Glu Met 
260 

Phe He Val Phe Thr Asp Ala Leu 
275 

Val Lys Phe Leu Ser Leu Leu Gin 
290 

Thr Ser Trp Val Val He Phe He 
305 

Asn Pro He Leu Tyr Thr Leu Thr 
320 

He His Arg Phe Trp Tyr Asn Tyr 
335 

Ser Lys Gly Gin Lys Thr Tyr Ala 
350 

Met Trp Pro Leu Gin Glu Met Pro 
365 

Leu Phe Thr Tyr Pro Cys Glu Met 
380 

Arg Leu Asn Ser Tyr Ser 
395 



<210> 5 

<211> 403 

<212> PRT 

<213> Homo sapiens 



CalU 


Asn 


Leu 


Leu Ala Ser 


He 




40 






45 


Val 


Ser 


Ala 


1 Thr* CVS 


Phe 




55 






60 


Arg 


Pro Tyr 


He Arg Ser 


Glu 




70 






75 


He 


Ser 


Leu 


Cys Cys Ala 


Asp 




85 






90 


Val 


He Gly Gly Phe Asp 


Leu 




100 






105 


His Ala Gin Leu Trp Met 


Glu 




11C 

lib 






120 


Ser 


Leu 


Ala 


He Leu Ser 


Thr 




130 






135 


Phe 


Leu 


Thr 


Leu Glu Lys 


Tyr 




145 






150 


Cys 


Val 


Arg 


Pro Gly Lys 


Cvs 




160 






165 


He 


Trp 


He Thr Gly Phe 


He 




175 






180 


Lys 


Glu 


Phe 


Phe Lys Asn 


Tyr 


190 






195 


Pro 


Leu 


His 


Ser Glu Asp 


Thr 




205 






210 


Ser 


Val 


Ala 


He Phe Leu Gly 




220 






225 


He 


Val 


Phe 


Ser Tyr Gly 


Ser 




235 






240 


Ala 


He 


Thr 


Ala Thr Glu 


He 




250 






255 


lie 


Leu 


Ala 


Lys Arg Phe 


Phe 




265 






270 


Cys 


Trp 


He 


Pro He Phe 


Val 




280 






285 


Val 


Glu He Pro Gly Thr 


He 




295 






300 


Leu 


Pro 


He 


Asn Ser Ala 


Leu 




310 






315 


Thr 


Arg 


Pro 


Phe Lys Glu 


Met 




325 






330 


Arg 


Gin 


Arg 


Lys Ser Met 


Asp 




340 






345 


Pro 


Ser 


Phe 


He Trp Val 


Glu 




355 






360 


Pro 


Glu 


Leu 


Met Lys Pro Asp 




370 






375 


Ser 


Leu 


He 


Ser Gin Ser 


Thr 




385 






390 
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<220> 

<221> misc_feature 

<223> Incyte ID No: 2705201CD1 

<400> 5 

Met Phe Val Ala Ser Glu Arg Lys 

1 5 
Thr Phe Leu Leu Leu Phe Val lie 
20 

Ala Ser Thr Ser Arg Gly Cys Gly 
35 

Val Ser Leu Cys Asp Leu Asp Ala 
50 

Ala Val Ala Gly Ala Gly Ala Leu 
65 

lie Leu Leu Val Arg Leu Pro £>he 
80 

Ser Pro Val Gly Leu His Phe Leu 
95 

Leu Phe Gly Leu Thr Phe Ala Phe 
110 

lie Cys Ser Val Arg Arg Phe Leu 
125 

Cys Phe Ser Cys Leu Leu Ser Gin 
140 

Val Arg His Gly Thr Gly Pro Ala 
155 

Ala Leu Cys Leu Met Leu Val Gin 
170 

Leu Val Leu Thr Val Leu Arg Asp 
185 

Glu Pro Met Asp Phe Val Met Ala 
200 

Leu Val Val Thr Leu Gly Leu Ala 
215 

Phe Lys Arg Trp Lys Leu Asn Gly 
230 

Phe Leu Ser Val Leu He Trp Val 
245 

Phe Gly Asn Val Lys Leu Gin Gin 
260 

Thr Leu Ala He Thr Leu Ala Ala 
275 

Phe His Ala He Pro Glu He His 
290 

Gin Glu Asn Thr Pro Asn Tyr Phe 
305 

Arg Glu Thr Ala Phe Glu Glu Asp 
320 

Met Glu Asn Lys Ala Phe Ser Met 
335 

Arg Thr Ala Gly Phe Pro Asn Gly 
350 

Gly Ser Leu Gly Lys Arg Pro Ser 
365 

Tyr Gin Pro Thr Glu Met Ala Val 
380 



Met Arg Ala His Gin 


Val 


Leu 


10 




15 


Thr Ser Val Ala Ser 


Glu 


Asn 


25 




30 


Leu Asp Leu Leu Pro 


Gin 


Tyr 


40 




45 


He Trp Gly He Val 


Val 


Glu 


55 




60 


He Thr Leu Leu Leu 


Met 


Leu 


70 




75 


He Lys Glu Lys Glu 


Lys 


Lys 


85 




90 


Phe Leu Leu Gly Thr 


Leu Gly 


100 




105 


He He Gin Glu Asp 


Glu 


Thr 


115 




120 


Trp Gly Val Leu Phe 


Ala 


Leu 


130 




135 


Ala Trp Arg Val Arg 


Arg 


Leu 


145 




150 


Gly Trp Gin Leu Val 


Gly 


Leu 


160 




165 


Val He He Ala Val 


Glu 


Trp 


175 




180 


Thr Arg Pro Ala Cys 


Ala 


Tyr 


190 




195 


Leu He Tyr Asp Met 


Val 


Leu 


205 




210 


Leu Phe Thr Leu Cys 


Gly 


Lys 


220 




225 


Ala Phe Leu Leu He 


Thr 


Ala 


235 




240 


Ala Trp Met Thr Met 


Tyr 


Leu 


250 




255 


Gly Asp Ala Trp Asn 


Asp 


Pro 


265 




270 


Ser Gly Trp Val Phe 


Val 


He 


280 




285 


Cys Thr Leu Leu Pro 


Ala 


Leu 


295 




300 


Asp Thr Ser Gin Pro 


Arg 


Met 


310 




315 


Val Gin Leu Pro Arg 


Ala 


Tyr 


325 




330 


Asp Glu His Asn Ala 


Ala 


Leu 


340 




345 


Ser Leu Gly Lys Arg 


Pro 


Ser 


355 




360 


Ala Pro Phe Arg Ser 


Asn 


Val 


370 




375 


Val Leu Asn Gly Gly 


Thr 


He 


385 




390 
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Pro Thr Ala Pro Pro Ser His Thr Gly Arg His Leu Trp 
395 400 

<210> 6 

<211> 807 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 3036563CD1 

<400> 6 

Met Gly Thr Tyr His Cys lie Phe Arg Tyr Lys Asn Ser Tyr Ser 
15 10 15 

lie Ala Thr Lys Asp Val lie Val His Pro Leu Pro Leu Lys Leu 
20 25 30 

Asn lie Met Val Asp Pro Leu Glu Ala Thr Val Ser Cys Ser Gly 
35 40 45 

Ser His His He Lys Cys Cys He Glu Glu Asp Gly Asp Tyr Lys 
50 55 60 

Val Thr Phe His Met Gly Ser Ser Ser Leu Pro Ala Ala Lys Glu 
65 70 75 

Val Asn Lys Lys Gin Val Cys Tyr Lys His Asn Phe Asn Ala Ser 
80 85 90 

Ser Val Ser Trp Cys Ser Lys Thr Val Asp Val Cys Cys His Phe 
95 100 105 

Thr Asn Ala Ala Asn Asn Ser Val Trp Ser Pro Ser Met Lys Leu 

110 115 120 

Asn Leu Val Pro Gly Glu Asn He Thr Cys Gin Asp Pro Val He 

125 130 135 

Gly Val Gly Glu Pro Gly Lys Val He Gin Lys Leu Cys Arg Phe 

140 145 150 

Ser Asn Val Pro Ser Ser Pro Glu Ser Pro He Gly Gly Thr He 

155 160 165 

Thr Tyr Lys Cys Val Gly Ser Gin Trp Glu Glu Lys Arg Asn Asp 

170 175 180 

Cys He Ser Ala Pro He Asn Ser Leu Leu Gin Met Ala Lys Ala 

185 190 195 

Leu He Lys Ser Pro Ser Gin Asp Glu Met Leu Pro Thr Tyr Leu 

200 205 210 

Lys Asp Leu Ser He Ser He Gly Lys Ala Glu His Glu He Ser 

215 220 225 

Ser Ser Pro Gly Ser Leu Gly Ala He He Asn He Leu Asp Leu 

230 235 240 

Leu Ser Thr Val Pro Thr Gin Val Asn Ser Glu Met Met Thr His 

245 250 255 

Val Leu Ser Thr Val Asn He He Leu Gly Lys Pro Val Leu Asn 

260 265 270 

Thr Trp Lys Val Leu Gin Gin Gin Trp Thr Asn Gin Ser Ser Gin 

275 280 285 

Leu Leu His Ser Val Glu Arg Phe Ser Gin Ala Leu Gin Ser Gly 

290 295 300 

Asp Ser Pro Pro Leu Ser Phe Ser Gin Thr Asn Val Gin Met Ser 

305 310 315 

Ser Met Val He Lys Ser Ser His Pro Glu Thr Tyr Gin Gin Arg 

320 325 330 
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Phe Val Phe Pro Tyr Phe Asp Leu Trp Gly Asn Val Val He Asp 

335 340 345 

Lys Ser Tyr Leu Glu Asn Leu Gin Ser Asp Ser Ser He Val Thr 

350 • 355 360 

Met Ala Phe Pro Thr Leu Gin Ala He Leu Ala Gin Asp He Gin 

365 370 375 

Glu Asn Asn Phe Ala Glu Ser Leu Val Met Thr Thr Thr Val Ser 

380 385 390 

His Asn Thr Thr Met Pro Phe Arg He Ser Met Thr Phe Lys Asn 

395 400 405 

Asn Ser Pro Ser Gly Gly Glu Thr Lys Cys Val Phe Trp Asn Phe 

410 415 420 

Arg Leu Ala Asn Asn Thr Gly Gly Trp Asp Ser Ser Gly Cys Tyr 

425 430 435 

Val Glu Glu Gly Asp Gly Asp Asn Val Thr Cys He Cys Asp His 

440 445 450 

Leu Thr Ser Phe Ser He Leu Met Ser Pro Asp Ser Pro Asp Pro 

455 460 465 

Ser Ser Leu Leu Gly He Leu Leu Asp He He Ser Tyr Val Gly 

470 475 480 

Val Gly Phe Ser He Leu Ser Leu Ala Ala Cys Leu Val Val Glu 

485 490 495 

Ala Val Val Trp Lys Ser Val Thr Lys Asn Arg Thr Ser Tyr Met 

500 505 510 

Arg His Thr Cys He Val Asn He Ala Ala Ser Leu Leu Val Ala 

515 520 525 

Asn Thr Trp Phe He Val Val Ala Ala He Gin Asp Asn Arg Tyr 

530 535 540 

He Leu Cys Lys Thr Ala Cys Val Ala Ala Thr Phe Phe He His 

545 550 555 

Phe Phe Tyr Leu Ser Val Phe Phe Trp Met Leu Thr Leu Gly Leu 

560 565 570 

Met Leu Phe Tyr Arg Leu Val Phe He Leu His Glu Thr Ser Arg 

575 580 585 

Ser Thr Gin Lys Ala He Ala Phe Cys Leu Gly Tyr Gly Cys Pro 

590 595 600 

Leu Ala He Ser Val He Thr Leu Gly Ala Thr Gin Pro Arg Glu 

605 610 615 

Val Tyr Thr Arg Lys Asn Val Cys Trp Leu Asn Trp Glu Asp Thr 

620 625 630 

Lys Ala Leu Leu Ala Phe Ala He Pro Ala Leu He He Val Val 

635 640 645 

Val Asn He Thr He Thr He Val Val He Thr Lys He Leu Arg 

650 655 660 

Pro Ser He Gly Asp Lys Pro Cys Lys Gin Glu Lys Ser Ser Leu 

665 670 675 

Phe Gin He Ser Lys Ser He Gly Val Leu Thr Pro Leu Leu Gly 

680 685 690 

Leu Thr Trp Gly Phe Gly Leu Thr Thr Val Phe Pro Gly Thr Asn 

695 700 705 

Leu Val Phe His He He Phe Aia He Leu Asn Val Phe Gin Gly 

710 715 720 

Leu Phe He Leu Leu Phe Gly Cys Leu Trp Asp Leu Lys Val Gin 

725 730 735 

Glu Ala Leu Leu Asn Lys Phe Ser Leu Ser Arg Trp Ser Ser Gin 

740 745 750 

His Ser Lys Ser Thr Ser Leu Gly Ser Ser Thr Pro Val Phe Ser 
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755 

Met Ser Ser Pro lie 
770 

Thr Gly Thr Tyr Asn 
785 

Leu Glu Asn Ser Ser 
800 

<210> 7 
<211> 1824 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 1258981CB1 

<400> 7 

cggctcgagc cctcaccagc cggaaagtac gagtcggctc agcctggagg gacccaacca 60 
gagcctggcc tgggagccag gatggccatc cacaaagcct tggtgatgtg cctgggactg 120 
cctctcttcc tgttcccagg ggcctgggcc cagggccatg tcccacccgg ctgcagccaa 180 
ggcctcaacc ccctgtacta caacctgtgt gaccgctctg gggcgtgggg catcgtcctg 240 
gaggccgtgg ctggggcggg cattgtcacc acgtttgtgc tcaccatcat cctggtggcc 300 
agcctcccct ttgtgcagga caccaagaaa cggagcctgc tggggaccca ggtattcttc 360 
cttctgggga ccctgggcct cttctgcctc gtgtttgcct gtgtggtgaa gcccgacttc 420 
tccacctgtg cctctcggcg cttcctcttt ggggttctgt tcgccatctg cttctcttgt 480 
ctggcggctc acgtctttgc cctcaacttc ctggcccgga agaaccacgg gccccggggc 540 
tgggtgatct tcactgtggc tctgctgctg accctggtag aggtcatcat caatacagag 600 
tggctgatca tcaccctggt tcggggcagt ggcgagggcg gccctcaggg caacagcagc 660 
gcaggctggg ccgtggcctc cccctgtgcc atcgccaaca tggactttgt catggcactc 720 
atctacgtca tgctgctgct gctgggtgcc ttcctggggg cctggcccgc cctgtgtggc 780 
cgctacaagc gctggcgtaa gcatggggtc tttgtgctcc tcaccacagc cacctccgtt 840 
gccatatggg tggtgtggat cgtcatgtat acttacggca acaagcagca caacagtccc 900 
acctgggatg accccacgct ggccatcgcc ctcgccgcca atgcctgggc cttcgtcctc 960 
ttctacgtca tccccgaggt ctcccaggtg accaagtcca gcccagagca aagctaccag 1020 
ggggacatgt accccacccg gggcgtgggc tatgagacca tcctgaaaga gcagaagggt 1080 
cagagcatgt tcgtggagaa caaggccttt tccatggatg agccggttgc agctaagagg 1140 
ccggtgtcac catacagcgg gtacaatggg cagctgctga ccagtgtgta ccagcccact 1200 
gagatggccc tgatgcacaa agttccgtcc gaaggagctt acgacatcat cctcccacgg 1260 
gccaccgcca acagccaggt gatgggcagt gccaactcga ccctgcgggc tgaagacatg 1320 
tactcggccc agagccacca ggcggccaca ccgccgaaag acggcaagaa ctctcaggtc 1380 
tttagaaacc cctacgtgtg ggactgagtc agcggtggcg aggagaggcg gtcggatttg 1440 
gggagggccc tgaggacctg gccccgggca agggactctc caggctcctc ctccccctgg 1500 
caggcccagc aacatgtgcc ccagatgtgg aagggcctcc ctctctgcca gtgtttgggt 1560 
gggtgtcatg ggtgtcccca cccactcctc agtgtttgtg gagtcgagga gccaacccca 1620 
gcctcctgcc aggatcacct cggcggtcac actccagcca aatagtgttc tcggggtggt 1680 
ggctgggcag cgcctatgtt tctctggaga ttcctgcaac ctcaagagac ttcccaggcg 1740 
ctcaggcctg gatcttgctc ctctgtgagg aacaagggtg cctaataaat acatttctgc 1800 
tttattaact cttaaaaaaa aaaa 1824 

<210> 8 

<211> 2152 

<212> DNA 

<213> Homo sapiens 



760 765 
Ser Arg Arg Phe Asn Asn Leu Phe Gly Lys 

775 780 

Val Ser Thr Pro Glu Ala Thr Ser Ser Ser 

790 795 

Ser Ala Ser Ser Leu Leu Asn 

805 
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<220> 

<221> misc_f eature 

<223> Incyte ID No: 1459432CB1 

<400> 8 

ttatgtctgg tcgactctga attgggcttg gaggcggcac ggctgccagg ctacggaggt 60 
agaccccctt cccaactgcg gggcttgcgc tccgggacaa ggtggcaggc gctggaggct 120 
gccgcagcct gcgtgggtgg aggggagctc agctcggttg tggcagcatg cgaccggcac 180 
tggctggatg gacctggaag cctcgctgct gcccactggt cccaatgcca gcaacacctc 240 
tgatggcccc gataacctca cttcggcagg atcacctcct cgcacgggga gcatctccta 300 
catcaacatc atcatgcctt cggtgttcgg caccatctgc ctcctgggca tcatcgggaa 360 
ctccacggtc atcttcgcgg tcgtgaagaa gtccaagctg cactggtgca acaacgtccc 420 
cgacatcttc atcatcaacc tctcggtagt agatctcctc tttctcctgg gcatgccctt 480 
catgatccac cagctcatgg gcaatggggt gtggcacttt ggggagacca tgtgcaccct 54 0 
catcacggcc atggatgcca atagtcagtt caccagcacc tacatcctga ccgccatggc 600 
cattgaccgc tacctggcca ctgtccaccc catctcttcc acgaagttcc ggaagccctc 660 
tgtggccacc ctggtgatct gcctcctgtg ggccctctcc ttcatcagca tcacccctgt 720 
gtggctgtat gccagactca tccccttccc aggaggtgca gtgggctgcg gcatacgcct 780 
gcccaaccca gacactgacc tctactggtt caccctgtac cagtttttcc tggcctttgc 840 
cctgcctttt gtggtcatca cagccgcata cgtgaggatc ctgcagcgca tgacgtcctc 900 
agtggccccc acctcccagc gcagcatccg gctgcggaca aagagggtga cccgcacagc 960 
catcgccatc tgtctggtct tctttgtgtg ctgggcaccc tactatgtgc tacagctgac 1020 
ccagttgtcc atcagccgcc cgacccccac ctttgtctac ttatacaatg cggccatcag 1080 
cttgggctat gccaacagct gcctcaaccc gtttgtgtac atcgtgctct gtgagacgtt 114 0 
ccgcaaacgc ttggtcctgt cggtgaagcc tgcagcccag gggcagcttc gcgctgtcag 1200 
caacgctcag gcggctgacg aggagaggac agaaagcaaa ggcacctgat acttcccctg 1260 
ccaccctgca cacctccaag tcagggcacc acaacacgcc accgggagag atgctgagaa 1320 
aaacccaaga ccgctcggga aatgcaggaa ggccgggttg tgaggggttg ttgcaatgaa 1380 
ataaatacat tccatggggc tcacacgttg ctggggaggc ctggagtcag gtttggggtt 1440 
ttcagatatc agaaatcccc ttgggggagc aggatgagac ctttggatag aacagaagct 1500 
gagcaagaga acatgttggt ttggataacc ggttgcacta tatctgtgag ctctcaaatg 1560 
tcttcttccc aaggcaagag gtggaagggt actgactggg tttgtttaaa gtcaggcagg 1620 
gctggagtga gcagccaggg ccatgttgca caaggcctga gagacgggaa agggcccgat 1680 
cgctctttcc cgcctctcac tggtgcgatg gaaggtggcc tttctcccaa gctggtggat 174 0 
aatgaaaaat aaagcatccc atctctcggc gttccagcat cctgtcaatt tcccttttgc 1800 
tctagaggat gcatgtttat ttgaggggat gtggcactga gcccacagga gtaaaagccc 1860 
agtttgctag gaggtctgct tactgaaaac aaggagacct ggggtgggtg tggttggggg 1920 
tcttaaaact aataaaagct ggggtcgggg ggcttttgca gctctggtga cattctctcc 1980 
acggggcaca tttgctcagt cactaatcca gcttgagtgt ccgtgtgttc tgcatgtgca 2040 
ggggtcattc tagtgcccgg tgtgttggca tcatcttttt gctctagccc ttcctctcca 2100 
aaataaaatc aaataaagga aaatctccac ccaaaaaaaa aaaaaaaaaa gg 2152 

<210> 9 
<211> 1878 
<212> DNA 

<213> Homo sapiens 
<220> 

<22l> misc_f eature 

<223> Incyte ID No: 2214673CB1 

<400> 9 

cgcacagcgc gcaggtcctc accagagctc tggtggccac ctctgtcccg ccatgctgct 60 
caccgacagt ggccagggcc cacagcacca agaggcttgg gccacaaagt aaagggtcgc 120 
ggagcctcgc cggccgccat gtggagctgc agctggttca acggcacagg gctggtggag 180 
gagctgcctg cctgccagga cctgcagctg gggctgtcac tgttgtcgct gctgggcctg 240 
gtggtgggcg tgccagtggg cctgtgctac aacgccctgc tggtgctggc caacctacac 300 
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agcaaggcca gcatgaccat gccggacgtg 
gtgctcagcg ccctggcccc tgtgcacctg 
tggagtgtgg gcggcgaagt ccacgtggca 
gtggccatgt actccaccgc -cctgctgagc 
cggacctaca tggccagcgt gtacaacacg 
gcgctgctga ccagcttctc ctcgctgctc 
gcgctagagt gcgccaagat gcagaacgca 
ggctacgtgg tgccagcact ggccaccctc 
agggaggaca cgcccctgga ccgggacacg 
ctggtggcca ccgtgtgcac gcagtttggg 
gggcacacgg gcatcatctc gcgagggaag 
cactttgtga aggatttctc caaactcctg 
ctctaccgct acatgaacca gagcttcccc 
ccctgcgggg accggcactg ctccccggac 
gcggcccagc cctcctgggg agacgtgact 
gacgctcccc acatccttcc agaaggagac 
ttttcttgaa gtttcctttt tcccacaaat 
tggctggcat ctggcttgag tctccccgag 
ggtccacatc cgcaaaagcc tcctcgcctt 
agtgatgaaa gcttagagcc agtatttata 
gtttgtttta caaaaacaga tgtttcctag 
cctacgaaag aatggcaaca gccagggtgg 
gcaaggcctg ccgggtgtgc cgcagtcacc 
cctgagacgc ggagacatgg ctggtgttaa 
tcctcagcca ccaaatgtcc ctgacaccct 
gtttttttca gtatgaacct gtcctaaatc 
atataaataa acagaagc 



PCT/US99/20958 

tactttgtca acatggcagt ggcaggcctg 360 
ctcggccccc cgagctcccg gtgggcgctg 420 
ctgcagatcc ccttcaatgt gtcctcactg 480 
ctcgaccact acatcgagcg tgcactgccg 540 
cggcacgtgt gcggcttcgt gtggggtggc 600 
ttctacatct gcagccatgt gtccacccgc 660 
gaagctgccg acgccacgct ggtgttcatc 720 
tacgcgctgg tgctactctc ccgcgtccgc 780 
ggccggctgg agccctcggc acacaggctg 840 
ctctggacgc cacactatct gatcctgctg 900 
cccgtggacg cacactacct ggggctactg 960 
gccttctcca gcagctttgt gacaccactt 1020 
agcaagctcc aacggctgat gaaaaagctg 1080 
cacatggggg tgcagcaggt gctggcgtag 1140 
ctggtggacg cagagcactt agttaccctg 1200 
gagctgctgg aagagaagca ggaggggtgt 1260 
gccactcttg ggccaaggct gtggtccccg 1320 
gcctgtgcgt ctcccaaaca cgcagctcaa 1380 
cagcctcctc agcattcagt ttgtcaatga 1440 
ctttgtggtt aaaatacttg attccccctt 1500 
aaaaatgaca aatagtaaaa tgaacaaaac 1560 
ccgggccctg ccagtgggcg gcgtgtgcta 1620 
acagggttct gagaacattt cacagaagtg 1680 
atggagctat tcaatagcag tgacgcgctc 1740 
ccccagcccc cacagataac atcagctgag 1800 
aattcctcaa agtgtgcaca aaactaaaga 1860 

1878 



<210> 10 

<211> 1804 

<212> DNA 

<213> Homo sapiens 



<220> 

<22l> misc_f eature 

<223> Incyte ID No: 2488822CB1 

<400> 10 

taagtgttaa ctaaaagcat tttattaaat 
catgtataat acatgttcct ttgattgatt 
ggattgaaat ttcaaatatc caacaaacjga 
tatattttaa gaaattccag tactgtgggt 
acactgatgg aatttcatct ctagagaatc 
tctgggttgt atctgcagtt acctgctttg 
atatcaggtc tgagaacaag ctgtatgcca 
gcttaatggg aatatattta ttcgtgatcg 
acaataagca tgcgcagctg tggatggaga 
ccattctgtc cacagaagta tcagttttac 
tctgcattgt ctatcctttt agatgtgtga 
tgattctcat ttggattact ggttttatag 
ttttcaaaaa ctactatgca cccaatggag 
aaagtattgg agcccagatt tattcagtgg 
ttatcatcat agttttttcc tatggaagca 
cagcaactga aatacggaat caagttaaaa 
ttatagtatt tactgatgca ttatgctgga 
tgcttcaggt agaaatacca ggtaccataa 
ttaacagtgc tttgaaccca attctctata 



tgtccttcac agaaactcaa tttattaaac 60 
attaatttga tatttttagc agcctagaag 120 
tgtttagacc tcttgtgaat ctctctcaca 180 
atgcaccaca tgttcgcagc tgtaaaccaa 240 
tcttggcaag cattattcag agagtatttg 300 
gaaacatttt tgtcatttgc atgcgacctt 360 
tgtcaatcat ttctctctgc tgtgccgact 420 
gaggctttga cctaaagttt cgtggagaat 480 
gtactcattg tcagcttgta ggatctttgg 540 
tgttaacatt tctgacattg gaaaaataca 600 
gacctggaaa atgcagaaca attacagttc 660 
tggctttcat tccattgagc aataaggaat 72 0 
tatgcttccc tcttcattca gaagatacag 780 
caatttttct tggtattaat ttggccgcat 840 
tgttttatag tgttcatcaa agtgccataa 900 
aagagatgat ccttgccaaa cgttttttct 960 
tacccatttt tgtagtgaaa tttctttcac 1020 
cctcttgggt agtgattttt attctgccca 1080 
ctctgaccac aagaccattt aaagaaatga 1140 
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ttcatcggtt ttggtataac tacagacaaa 
catatgctcc atcattcatc tgggtggaaa 
taatgaagcc ggaccttttc acatacccct 
gactcaattc ctattcatga ctgactctga 
ggtgcttcat gagggattta ctggtatgaa 
agctaagata aatattttac aaggacatga 
aagggaagta attatatcaa taatgtatat 
agagaaatct acttcagtaa cattcattca 
ctactatgtg catagcattg caatatagtc 
ctgtagatgg tgtttaatga caaaagacta 
agtagagctt tacctgtcat gtgcatcagc 
ggac 



gaaaatctat ggacagcaaa ggtcagaaaa 1200 
tgtggccact gcaggagatg ccacctgagt 1260 
gtgaaatgtc actgatttct caatcaacga 1320 
aattcatttc ttcgcagaga atactgtggg 1380 
atgaatacca caaaattaat ttataataat 1440 
ggaaaaataa aaatgactaa tgctcttaca 1500 
atattagtag acattttgca taagaaatta 1560 
tttttctaac atgcatttat tgagtaccca 1620 
ctggaagtag acagtgcaga acctttcaat 1680 
tacaaagtcc atctgcagtt cctagtttaa 1740 
aagaatcata gcgattttaa atagaggtgt 1800 

1804 



<210> 11 

<211> 1520 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 2705201CB1 



<400> 11 

tgccgaagag tctggagcgt cggcgctgcg 
agagagaaag atgagagctc accaggtgct 
ggtggcctct gaaaacgcca gcacatcccg 
cgtgtccctg tgcgacctgg acgccatctg 
gggcgccctg atcacactgc tcctgatgct 
ggagaaggag aagaagagcc ctgtgggcct 
cctctttggg ctgacgtttg ccttcatcat 
ccgcttcctc tggggcgtcc tctttgcgct 
gcgcgtgcgg aggctggtgc ggcatggcac 
ggcgctgtgc ctgatgctgg tgcaagtcat 
gctgcgtgac acaaggccag cctgcgccta 
ctacgacatg gtactgcttg tggtcaccct 
gttcaagagg tggaagctga acggggcctt 
catctgggtg gcctggatga ccatgtacct 
tgcctggaac gaccccacct tggccatcac 
cttccacgcc atccctgaga tccactgcac 
caactacttc gacacgtcgc agcccaggat 
gctgccgcgg gcctatatgg agaacaaggc 
ccgaacagca ggatttccca acggcagctt 
aagacccagc gctccgttta gaagcaacgt 
caacggtggg accatcccaa ctgctccgcc 
gactttaagt tccagagaat cagaatttct 
tcttgaggga gaaatcggta acagttgccg 
gaaatcctag ccaaggggat ttcgtgtaaa 
ccgactgccc gcccctcccc tgccacacac 
ccccacctta aaaaaaaaaa 

<210> 12 
<211> 2919 
<212> DNA 

<213> Homo sapiens 



gggccgcggg ggtcgaatgt tcgtggcatc 60 
caccttcctc ctgctcttcg tgatcacctc 120 
aggctgtggg ctggacctcc tccctcagta 180 
gggcattgtg gtggaggcgg tggccggggc 240 
catcctcctg gtgcggctgc ccttcatcaa 300 
ccactttctg ttcctcctgg ggaccctggg 360 
ccaggaggac gagaccatct gctctgtccg 420 
ctgcttctcc tgcctgctga gccaggcatg 480 
gggccccgcg ggctggcagc tggtgggcct 540 
catcgctgtg gagtggctgg tgctcaccgt 600 
cgagcccatg gactttgtga tggccctcat 660 
ggggctggcc ctcttcactc tgtgcggcaa 720 
cctcctcatc acagccttcc tctctgtgct 780 
cttcggcaat gtcaagctgc agcaggggga 840 
gctggcggcc agcggctggg tcttcgtcat 900 
ccttctgcca gccctgcagg agaacacgcc 960 
gcgggagacg gccttcgagg aggacgtgca 1020 
cttctccatg gatgaacaca atgcagctct 1080 
gggaaaaaga cccagtggca gcttggggaa 1140 
gtatcagcca actgagatgg ccgtcgtgct 1200 
aagtcacaca ggaagacacc tttggtgaaa 1260 
cttaccgatt tgcctccctg gctgtgtctt 1320 
aaccaggccg cctcacagcc aggaaatttg 1380 
tgtgaacact gacgaactga aaagctaaca 1440 
acagacacgt aataccagac caacctcaat 1500 

1520 
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<220> 

<221> misc_f eature 

<223> Incyte ID No: 3036563CB1 

<400> 12 

atcttgatgg agcagaatca gtactgacag 
acqtatcact gcatatttag atataagaat 
gttcacccgc tgcctctaaa gctgaacatc 
tgcagtggtt cccatcacat caagtgctgc 
ttccatatgg gttcctcatc ccttcctgct 
tacaaacaca atttcaatgc aagctcagtt 
tgtcacttta ccaatgctgc taataattca 
gttcctgggg aaaacatcac atgccaggat 
gtcatccaga agctatgccg gttctcaaac 
gggaccatca cttacaaatg tgtaggctcc 
tctgccccaa taaacagtct gctccagatg 
gatgagatgc tccctacata cctgaaggat 
gaaatcagct cttctcctgg gagtctggga 
acagttccaa cccaagtaaa ttcagaaatg 
atccttggca agcccgtctt gaacacctgg 
agttcacagc tactacattc agtggaaaga 
cctccattgt ccttctccca aactaatgtg 
cacccagaaa cctatcaaca gaggtttgtt 
gtcattgaca agagctacct agaaaacttg 
ttcccaactc tccaagccat ccttgctcag 
ttagtgatga caaccactgt cagccacaat 
tttaagaaca atagcccttc aggcggcgaa 
gccaacaaca caggggggtg ggacagcagt 
aatgtcacct gtatctgtga ccacctaaca 
ccagatccta gttctctcct gggaatactc 
ttttccatct tgagcttggc agcctgtcta 
accaagaatc ggacttctta tatgcgccac 
ctggtcgcca acacctggtt cattgtggtc 
tgcaagacag cctgtgtggc tgccaccttc 
ttctggatgc tgacactggg cctcatgctg 
acaagcaggt ccactcagaa agccattgcc 
atctcggtca tcacgctggg agccacccag 
tgttggctca actgggagga caccaaggcc 
attgtggtgg tgaacataac catcactatt 
attggagaca agccatgcaa gcaggagaag 
ggggtcctca caccactctt gggcctcact 
gggaccaacc ttgtgttcca tatcatattt 
attttactct ttggatgcct ctgggatctg 
tcattgtcga gatggtcttc acagcactca 
gtgttttcta tgagttctcc aatatcaagg 
acgtataatg tttccacccc agaagcaacc 
tcttcgttgc tcaactaaga acaggataat 
ctgtgctttt aaaaagagat gcttgcaaag 
ccgggagcag atgccaaaaa gactttttca 
aataaaaata attgttatgt ttctgtttgt 
atgtgtatag tatttaagtg aaactcaagc 
taatatagaa tttcgaagag acattttcac 
tgattaaagt agtaagtaaa aggctaccta 
ggaaggaagg aaggagggaa agaagggagg 



tcaagacctc gaccagggag tggaatggga 60 
tcatacagta ttgcaaccaa agacgtcatt 120 
atggttgatc ctttggaagc tactgtttca 180 
atagaggagg atggagacta caaagttact 240 
gcaaaagaag ttaacaaaaa acaagtgtgc 300 
tcctggtgtt caaaaactgt tgatgtgtgt 360 
gtctggagcc catctatgaa gctgaatctg 420 
cccgtaatag gtgtcggaga gccggggaaa 480 
gttcccagca gccctgagag tcccattggc 540 
cagtgggagg agaagagaaa tgactgcatc 600 
gctaaggctt tgatcaagag cccctctcag 660 
ctttctatta gcataggcaa agcggaacat 720 
gccattatta acatccttga tctgctctca 780 
atgacgcacg tgctctctac ggttaatatc 840 
aaggttttac aacagcaatg gaccaatcag 900 
ttttcccaag cattacagtc aggagatagc 960 
cagatgagca gcatggtaat caagtccagc 1020 
ttcccatact ttgacctctg gggcaatgtg 1080 
cagtcggatt cgtctattgt caccatggct 1140 
gatatccagg aaaataactt tgcagagagc 1200 
acgactatgc cattcaggat ttcaatgact 1260 
acgaagtgtg tcttctggaa cttcaggctt 1320 
gggtgctatg ttgaagaagg tgatggggac 1380 
tcattctcca tcctcatgtc ccctgactcc 1440 
ctggatatta tttcttatgt tggggtgggc 1500 
gttgtggaag ctgtggtgtg gaaatcggtg 1560 
acctgcatag tgaatatcgc tgcctccctt 1620 
gctgccatcc aggacaatcg ctacatactc 1680 
ttcatccact tcttctacct cagcgtcttc 1740 
ttctatcgcc tggttttcat tctgcatgaa 1800 
ttctgtcttg gctatggctg cccacttgcc 1860 
ccccgggaag tctatacgag gaagaatgtc 1920 
ctgctggctt tcgccatccc agcactgatc 1980 
gtggtcatca ccaagatcct gaggccttcc 2040 
agcagcctgt ttcagatcag caagagcatt 2100 
tggggttttg gtctcaccac tgtgttccca 2160 
gccatcctca atgtcttcca gggattattc 2220 
aaggtacagg aagctttgct gaataagttt 2280 
aagtcaacat ccctgggttc atccacacct 2340 
agatttaaca atttgtttgg taaaacagga 2400 
agctcatccc tggaaaactc atccagtgct 2460 
ccaacctacg tgacctcccg gggacagtgg 2520 
caatggggaa cgtgttctcg gggcaggttt 2580 
tagagaagag gctttctttt gtaaagacag 2640 
tccctccccc tcccccttgt gtgataccac 2700 
cctcaaggcc caacttctct gtctatattg 2760 
tttttacaca ttgggcacaa agataagctt 2820 
ggaaatactt cagtgaattc taagaaggaa 2880 
aaaccagga 2919 
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